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PRECEDING  FAQ!  BLANK* NOT  TllMB) 

INTRODUCTION \ 

Under  contract  MDA903-77-C-0366 , General  Research  Corporation 
(GRC)  has  accomplished  a series  of  tasks  to^enhance  and  extend  the  capa- 
bilities of  the  ELIM-COMPLIP  manpower  programming  system.  The  resulting 
enhanced  system  has  been  designated  as  ELIM-IV/C0MPLIP-G3 . 

This  report  reviews  the  objectives  of  this  study  and  the  technical 
foundations  of  the  efforts  undertaken  by  GRC  to  accomplish  the  tasks 
required  to  satisfy  these  objectives. 

OBJECTIVES  OF  THE  TASKS 


Task  1 - Second-Timer  Aging 

Expand  the  Inventory  Projection  Module  to  include  a two-dimensional 
array  for  the  First-Term  Second  Timers  so  that  this  population  is  seg- 
mented by  months  to  ETS  and  months  of  service.  Modify  the  Inventory 
Projection  Module  so  that  loss  rates  which  are  generated  by  the  system 
are  properly  applied  to  these  two-dimensional  arrays.  Modify  the 
Data  Processor  and  Factor  Development  Modules  so  that  loss  rates  applicable 
to  elements  of  these  two-dimensional  population  arrays  are  properly 
generated,  similar  to  those  currently  generated  for  the  remaining 
population  arrays. 

Task  2 - C-Group  Expansion 

Expand  the  current  number  of  characteristic  groups  for  non-prior 
service  (NPS)  gain  projections  from  4 to  a possible  maximum  of  40. 

The  increased  number  of  characteristic  groups  would  allow  the  user  to 
group  on  enlistment  terms  in  addition  to  groupings  on  sex,  education, 
and  mental  category.  The  period  for  which  loss  data  relevant  to  these 
characteristics  are  developed  and  retained  is  to  be  expanded  from  21 
to  55  months.  Each  of  the  characteristic  groups  is  to  include  the  fol- 
lowing loss  categories:  Expeditious  Discharge,  Trainee  Discharge,  Other 
Adverse,  Physical  Disqualification,  Research,  Other,  Dropped  from  Mili- 
tary Control,  Immediate  Reenlistment,  ETS,  and  Extension.  Information 
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on  marital  status,  number  of  dependents,  and  other  variables  to  be  selec- 
ted by  the  user  will  be  stored  on  the  new  data  base  for  retrieval  pur- 
poses. The  user  will  also  be  provided  the  capability  to  vary  the  number 
of  characteristic  groups  used  in  a forecast. 

Task  3 - User-Defined  Quality  Classes 

Provide  program  specifications  to  USAMSSA  which  would  allow  a 
simplified  user  selection  of  the  data  from  the  large  non-prior  service 
cohort  file  when  the  large  numeric  data  base  is  being  developed.  USAMSSA 
will  implement  the  specifications  and  bring  the  programs  on  line.  The 
result  will  be  the  capability  to  create  the  large  numeric  data  base 
from  scratch  by  creating  an  all-months  version  of  file  MT63A  for  input 
into  Program  QP8. 

Task  k - Non-Prior  Army  Service  Gains 

Provide  the  capability  to  automatically  pass  Non-Prior  Army  Ser- 
vice Gains  data  via  throughput  files  to  the  Report  Generator  Module 
(RGM)  and  allow  the  user  to  directly  input  this  gain  category  projection 
into  the  Inventory  Projection  Module  (IPM). 

Task  5 - Career  Force  Optimization 


Develop  the  capability  to  optimize  the  various  types  of  enlisted 
gains  to  the  career  population  category  in  order  to  achieve  and  main- 
tain a designated  career  strength.  In  addition,  develop  the  capabil- 
ity to  optimize  simultaneously  the  career  strength  by  years  of  service 
in  order  to  achieve  a user-specified  years -of-service  profile.  This 
optimization  should  take  place  within  the  linear  programming  portion 
of  the  ELIM-COMPLIP  model  and  allow  a wide  range  of  analyst  influence. 
During  optimization  the  appropriate  internally  generated  loss  rates 
must  be  applied  to  the  various  population  elements.  The  years-of-service 
array  should  be  broken  out  individually  (by  each  year)  for  at  least 
the  first  20  years  of  service. 


In  order  to  optimize  the  YOS  distribution  an  objective  function 
will  be  added  to  COMPLIP  which  minimizes  the  weighted  sum  of  the  devi- 
ations of  the  actual  from  the  planned  career  force  strength,  by  YOS, 


2 


subject  to  user-specified  constraints  on  total  strength,  the  supply 
of  UPS  enlistees,  PS  enlistees  by  YOS,  and  the  first-term  reenlistment 
gains  by  years  of  service. 

When  the  user  selects  the  career  force  optimization  mode,  COMPLIP 
will  determine  the  first-term  reenlistments  which  minimize  the  above 
objective  function.  Then,  new  reenlistment  factors  will  be  computed 
from  the  optimal  reenlistment  gains.  A second  pass  through  the  IPM 
will  apportion  the  optimum  reenlistment  gains  by  C-group  and  compute 
careerist  losses. 

Task  6 - First-Term  Tracking 

Develop  the  capability  to  track  and  observe  all  enlisted  person- 
nel, including  extendees,  through  their  initial  enlistment  until  they 
reenlist  or  separate.  This  is  to  provide  a more  accurate  accounting 
of  First  and  Career  Reenlistments.  In  the  existing  ELIM-COMPLIP  sys- 
tem, when  First  Termers  with  enlistments  of  4 to  6 years  reenlist  after 
36  months  of  service  they  are  counted  as  Career  Reenlistments  as  opposed 
to  First  Reenlistments. 

Task  7 - First  Term  to  Careerist  Transitions 

Develop  the  capability  to  display  by  month  and  year  the  dynamic 
flow  of  personnel  as  they  age  from  the  First  Term  to  the  Career  popul- 
ation category.  The  objective  of  this  and  of  Task  6 is  to  provide  the 
analyst  with  the  information  needed  to  analyze,  project,  and  control 
the  career  strength. 

Task  8 - Loss  Rate  Projection  Methods 

Develop  the  capability  to  use  where  applicable  a least  squares 
technique  for  fitting  an  exponential  equation  to  historical  data  and 
then  use  that  equation  for  loss  projection.  The  Factor  Development 
module  should  be  modified  and  provided  with  extensive  but  simple-to- 
apply  user  controls  on  the  least  squares  coefficients. 

In  addition,  other  statistical  techniques  as  well  as  those  used 
in  the  NPS  Gains  Module  are  to  be  examined  for  possible  inclusion  in 
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Task  11  - Key  Word  Processor 

Develop  a key  word  processor  to  create  the  40  characteristic  groups 
for  input  to  the  ELIM-COMPLIP  system  from  the  cohort  file  developed 
in  Task  10.  It  should  be  based  upon  logical  selection  criteria  using 
an  English  language  type  command  structure  to  simplify  user  control 
and  TSO  operations. 

Task  12  - Cohort  Targeting  Routine 

Examine  the  feasibility  and  acceptability  of  a gains  cohort  loss 
analysis  and  provide  ELIM  with  the  capability  of  tracing  loss  rates 
for  yearly  cohorts  and  targeting  for  their  loss  projections.  The  result 
of  this  type  of  analysis  should  be  more  accurate  loss  projections  and 
the  ability  to  distinguish  yearly  gain  cohorts  and  their  related  losses 
from  one  another.  The  user  should  have  the  capability  to  place  constraints 
on  total  losses  for  any  given  yearly  cohort  group.  Develop  graphs  of 
the  loss  patterns  of  the  individual  yearly  cohorts  as  well  as  summary 
graphs. 

In  this  task,  GRC  is  to  develop  a system  to  permit  the  user  to 
modify  projected  loss  rates  for  first  timers  by  accession  cohort.  The 
system  is  to  operate  independently  of  the  ELIM-COMPLIP  production  flow. 

It  will  accept  as  input  the  standard  loss  rate  projections  by  C-group 
by  month  of  service;  construct  accession  cohort  loss  curves  by  month 
of  service;  accept  user  modifications  to  those  curves;  and,  when  order- 
ed, revise  the  standard  projected  loss  rates  in  accordance  with  the 
results  of  the  users'  modifications  of  the  cohort  loss  curves.  The 
system  will  operate  either  interactively  or  in  a batch  mode. 

Task  13  - NPS  Gains  Forecasting 

Modify  the  existing  NPS  Gains  Module  to  provide  the  data  neces- 
sary to  use  the  expanded  number  of  characteristic  groups  provided  by 
Task  2.  This  requires  the  capability  to  provide  NPS  gair  forecasts 
for  the  expanded  number  of  characteristic  groups  as  well  as  projections 
of  seasonality  patterns  and  availability  constraints  of  supply-limited 
groups. 
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The  NPS  Gains  Module  was  created  as  part  of  the  ELIM-III/COMPLIP- 
G2  development  effort.  It  provides  COMPLIP  with  one-year-ahead  month- 
ly forecasts  of  accessions  by  C-group  using  a historical  data  base  de- 
rived from  the  monthly  transaction  tapes  and  a modified  least  squares 
time-series  methodology  with  nonlinear  deseasonaliz ing.  It  is  capable 
of  analyzing,  graphing,  and  projecting  gains  by  month  for  any  of  the 
150  accession  groups  in  the  Large  Numeric  Data  Base.  Since  only  four 
C-groups  can  be  handled  in  ELIM-III/COMPLIP-G2 , the  NPS  Gains  Module 
always  requires  that  the  groups  analyzed  and  projected  be  aggregated 
up  to  the  maximum  of  four  C-groups  defined  in  the  Small  Numeric  Data 
Base. 


With  the  expansion  of  the  C-groups  and  the  new  data  base  planned 
for  current  development,  changes  will  be  required  in  the  NPS  Gains  Module. 
For  the  most  part  these  changes  will  be  only  those  needed  because  of 
the  C-group  and  data  base  changes.  A few  small  enhancements  will,  how- 
ever, also  be  accomplished.  The  one -year-ahead  restriction  will  be 
relaxed  so  that  projections  can  cover  the  full  COMPLIP  projection  time- 
frame.  Projected  gains  by  C-group  will  be  available  to  the  IPM  as  well 
as  to  COMPLIP.  One  or  more  new  subroutines  will  be  provided  to  permit 
user  modifications  of  the  projections  similar  to  those  available  for 
modifying  loss  rate  projections. 

Task  14  - Matrix  Generator  Enhancements 

Develop  the  capability  within  the  matrix  generator  to  express 
linkages  between  the  various  characteristic  groups  as  ratios  or  per- 
centages of  each  other  (e.g.,  Mental-Category-IV  High-School  Diploma- 
Graduate  equals  10Z  of  all  High-School-Diploma-Graduate  Accessions  dur- 
ing a particular  projection  period).  Also  develop  the  capability  to 
equate  one  of  the  characteristic  groups  to  WACs.  Accommodate  the  spe- 
cial constraints  of  the  budget  year  by  developing  different  capabilities 
for  the  budget  year  and  the  remaining  years. 

The  existing  ELIM-COMPLIP  system  is  to  be  modified  by  GRC  to  pro- 
vide the  user  with  the  capability  within  the  matrix  generator  to  specify 
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the  accession  level  of  first-term  enlistments  of  one  characteristic 
group  as  a fraction  of  the  accession  level  of  another  characteristic 
group  and  with  the  capability  to  equate  one  of  the  characteristic  groups 
to  WACs  with  user-supplied  accession  levels. 

Task  15  - Support  and  Training 

Provide  assistance  in  adapting  all  related  computer  programs  to 
the  designated  government  computer  and  assist  in  training  users. 

IMPACT  OF  THE  TASKS  ON  THE  SYSTEM 


Many  of  these  tasks  address  needs  of  particular  modules  of  the 
system  and  have  little,  if  any,  impact  on  the  other  parts  of  the  sys- 
tem. Technical  discussions  of  the  accomplishments  of  such  tasks  are 
provided  in  separate  appendixes  to  this  report.  On  the  other  hand, 
some  tasks  required  significant  modifications  to  most,  if  not  all,  parts 
of  the  system.  These  tasks  include: 


• 

Task 

1: 

Second-Timer  Aging 

• 

Task 

2: 

C-Group  Expansion 

• 

Task 

6: 

First-Term  Tracking 

• 

Task 

7: 

First-Term  to  Careerist  Transitions 

The  changes  in  system  design  and  operation  required  to  accomplish 
these  tasks  are  discussed  in  the  following. 

The  changes  in  the  inventory  projection  methodology  and  data  arrays 
described  here  enable  ELIM-IV  to  satisfy  the  requirements  of  several 
of  the  tasks  described  above.  All  personnel,  including  extendees,  are 
tracked  through  their  entire  initial  enlistments  until  they  either  re- 
enlist or  separate.  The  nunfrer  of  C-groups  is  expanded  so  that  term 
of  service  can  be  used  as  a classification  characteristic.  Provisions 
have  been  made  for  operating  on  up  to  40  C-groups.  Personnel  are  pro- 
jected by  C-groups  through  the  first  55  months  of  service. 
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The  inventory  projection  procedures  can  be  divided  into  those 
used  for  projecting  populations — whose  strength  levels  are  not  deter- 
mined by  COMPLIP,  and  those  used  for  projecting  rates — for  segments 
of  the  inventory  whose  strength  levels  are  determined  by  COMPLIP. 

Projection  of  Populations 

To  handle  the  projection  of  many  more  C-groups  for  many  more  months, 
the  FT I array  has  been  expanded  and  the  Second-Timer  (STI)  and  careerist 
arrays  have  been  redefined.  The  ELIM-II/III  array  used  for  4 C-groups 
and  21  months  of  service  is  replaced.  The  three  area  matrices  (CONUS, 
Short-Tour,  and  Long-Tour)  and  the  AUS  matrix  are  unchanged. 

Changes  to  FTI  Array 

The  most  notable  change  is  in  the  C-group  arrays.  These  arrays 
in  ELIM-II/III  are  two-dimensional,  by  C-group  (CG  * 1,2, 3, 4)  and 
month  of  service  (m  “ 1,2. .20,21).  A third  dimension  has  been  added 
to  this  array  to  accommodate  the  term  of  enlistment.  The  summary  num- 
bers across  C-groups  are  to  be  retained  in  the  non-qualitative  First- 
Timer  (FTI)  arrays.  These  are  not  merely  the  summation  of  the  numbers 
in  the  C-groups  but  are  rather  the  total  counts  used  to  normalize  C- 
group  values.^"  That  is,  we  are  not  merely  spreading  the  data  in  the 
current  array  over  an  additional  dimension.  Figure  1 illustrates  this 
concept. 

To  minimize  the  increase  in  the  size  of  the  IPM,  the  maximum  num- 
ber of  C-groups  for  each  term  of  enlistment  is  limited  to  sixteen  each 
for  terms  3 and  4,  and  four  each  for  terms  2,  5,  and  6 (subject,  in 
total,  to  the  40  group  overall  limitation).  Over  the  short  term,  it 
is  planned  that  a c canon  set  of  eight  C-groups  will  be  defined  for  terms 
3 and  4,  and  that  terms  2,  5,  and  6 would  have  one  C-group  each. 


*The  totals  in  the  FTI  array  do  not  equal  the  sums  of  the  C-groups, 
because  some  observations  are  dropped  during  creation  of  the  C-group 
array. 
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CG  - Characteristic  Group;  h ” month  of  service; 
t ■ months  to  ETS;  FT  » First  Term;  RA  - Regular  Army;  FTI  • First  Timers 

Figure  1.  Relationships  Between  C-Group  and  FTI  Arrays 


Changes  to  ST I Array 

During  historical  tracking  and  inventory  projection,  populations 
remain  in  the  FTI  and  C-group  arrays  until  they  either  extend,  reen- 
list, separate,  or,  in  the  case  of  C-group  arrays,  have  served  55  months. 
The  STI  array  is  redefined  as  an  extendee-only  array.  It  holds  all 
extendees  until  they  either  reenlist  or  separate  or  have  served  55  months. 
Administrative  gains,  reservists  who  enter  active  duty,  prior  service 
gains  who  have  not  completed  a first  reenlistment,  and  other  miscella- 
neous gains  are  added  to  the  revised  "Reenlisted"  array  by  year  of  ser- 
vice. Enlistees  in  the  STI  array  are  classified  by  year  of  service 
and  months  to  ETS. 

Changes  to  Careerist  Array 

The  present  careerist  array  has  been  redefined  as  a reenlisted 
array.  Any  first-timer  or  extendee  who  reenlists  will  move  to  the  re- 
enlisted array.  Since  reenlistment  can  occur  before  the  36th  month 
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of  service,  Che  reenlisted  array  has  been  expanded  to  include  people 
with  fewer  than  three  years  of  service.  The  array  has  been  expanded 
to  present  data  by  individual  years  of  service  for  non-retirement  eli- 
gible personnel;  that  is,  the  length  of  service  dimension  is  increased 
from  two  classes  (NRE,  RE)  to  as  many  as  21  classes  (1,2, ...20,  21+). 
The  time-to-ETS  dimension  will  not  be  changed.  The  reenlisted  array 
is  intended  to  contain  every  person  who  is  not  either  a First-Timer 
or  a First-Timer  on  an  extension. 

Projection  of  Rates 

The  IPM/QIPM  applies  loss  rates  to  retention  rates  as  well  as 
to  populations.  Retention  rates  are  used  to  calculate  NPS  gains  in 
COMPLIP.  When  people  are  projected  to  extend  or  reenlist,  their  num- 
bers can  be  added  to  the  extendee  or  reenlisted  array.  However,  when 
extensions  and  reenlistments  occur  which  are  represented  by  retention 
rates  (with  actual  levels  determined  by  COMPLIP)  they  must  be  tracked 
in  a separate  array,  called  "extensions  and  reenlistments  of  retention 
rates."  The  portion  of  the  array  for  3 and  4 year  initial  terms  of 
enlistment  is  represented  in  Figure  2.  Unlike  the  numbers  representing 
people,  retention  rates  cannot  be  added  to  the  reenlisted  array. 

The  array  in  Figure  2 is  large:  there  will  be  a maximum  of  40 

C-groups  and  7 years  for  each  t.  In  addition,  at  least  89  classes  in 

the  month-of-accession  dimension  will  be  required.  As  the  projection 

proceeds  past  month  tt(2),  the  first  month  for  which  COMPLIP  determines 

FT  RA  accesions,  the  array  will  begin  to  fill  with  retention  rates, 

r.  .,  for  the  C-groups /terms  j ■ 1, ...n.  The  non-zero-filled  rows 
if  J 

will  run  from  ff(2)  to  i,  the  current  projection  month, 
implementation,  this  will  be  a 16  by  7 by  89  array. 
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Figure  2.  Extension  Retention  Rate  Array 
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Modifications  to  Other  Modules 

The  routines  in  DPM/QDPM  which  establish  historical  inventories 
for  arrays  not  classified  by  C-group — ADS,  FTI,  STI,  CAR,  and  the  three 
area  arrays— will  be  changed  very  little.  They  will  be  modified  to 
assign  people  to  the  appropriate  arrays  based  to  the  criteria  discussed 


Modifications  Affecting  COMPLIP 

Since  C-groups  will  include  term  of  service,  40  sets  of  reten- 
tion rates  might  have  to  be  input  to  COMPLIP  to  compute  NPS  gains  for 
each  C-group.  Although  technically  feasible,  this  would  substantially 
increase  COMPLIP's  run  time.  To  avoid  such  a large  increase  in  the 
size  of  the  linear  program,  the  C-groups  for  terms  3 and  4 are  sunned 
across  term  in  the  IPM  as  are  the  C-groups  for  terms  2,  5,  and  6.  Fac- 
tors are  provided  by  QFAC  to  spread  NPS  accession  reguirements  by  C- 
group  back  to  the  specific  terms  of  enlistment  in  the  Report  Generator. 
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APPENDIX  A 

THE  ELIM-IV  QUALITATIVE  DATA  BASE 


INTRODUCTION 


This  appendix  discusses  Che  enhancement  Co  Che  data  bases  which  will 
be  used  Co  provide  C-group  expansion  and  Cracking,  including  extendee  crack- 
ing. This  would  be  added  Co  those  of  ELIM-II  and  ELIM-III  to  accomplish 
Che  following  casks: 


• Expand  C-groups  from  4 to  40,  with  term  of  service  as  a C-group 
characteristic 

• Maintain  data  by  C-group  for  a longer  period  of  service 

• Track  enlisted  personnel  until  separation  or  first  reenlistment, 
explicitly  separating  first  from  career  reenlistments  (relates 
to  "first  enlistment  cohort  tracking"  and  "first-timer  to 
reenlistee  transition"  in  proposal) 

• Simplify  redefinition  of  C-groups  and 

• Enable  user  to  calculate  historical  loss  rates  and  current 
inventories  by  marital  status 


It  is  also  designed  to  facilitate  projecting  C-group  transitions,  if  that 
task  is  undertaken  in  the  future. 


A tracking  file  has  been  created  with  one  record  for  each  individual 
in  the  data  base.  The  record  indicates  his/her  entry  and,  where  poten- 
tially different,  current  characteristics,  and  the  month  and  type  of  each 
transaction  relevant  to  ELIM-COMPLIP.  The  small  numeric  data  base  has 
been  expanded  to  40  C-groups;  each  month  it  can  be  updated  directly  or  it 
can  be  re-created  from  the  new  tracking  file.  There  is  no  more  need  for  an 
intermediate  large  numeric  qualitative  data  base.  The  system  allows  for  an 
individual  to  change  C-groups.  Persons  would  be  counted  in  C-groups  until 
discharge. 

DESCRIPTION  OF  THE  DATA  BASE 

The  new  data  base,  the  tracking  file,  exists  in  two  forms:  the  so- 
called  large  (Table  A. 1)  and  small  (Table  A. 2)  tracking  files.  They  replace 
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TABLE  A.l 

LARGE  TRACKING  FILE 


1-9 

Social  Security  Number 

10 

NPS  Indicator 

11-16 

Date  of  Birth 

17-18 

AFQT 

19 

Sex 

20 

Race 

21-24 

Term  of  Service 

22-25 

BPED 

26 

Education  (Initial) 

27 

Mental  Category 

28-31 

Enlistment  Options 

32-33 

CMF 

34 

Pay  Grade  (Initial) 

35 

Eligibility  to  Reenlist 

36 

Moral  Waiver 

37 

Character  of  Separation 

38 

Prior  Service  Indicator 

39-42 

ETS 

43-45 

UIC 

46 

Bonus 

47 

Times  Reenlisted 

48-50 

Age  at  Entry,  months 

51 

Ethnic 

52 

Receiving  Station  (no  good) 

53-56 

BASD 

57 

Marital  Status 

58 

Number  Dependents 

59-60 

Initial  Assignment 

61-63 

Initial  Location 

64-66 

Initial  Primary  MOS 

67 

Current  Pay  Grade 

68-69 

Current  Assignment 

70-72 

Current  Location 

73 

Current  Education 

74-76 

Current  Duty  MOS 

77-79 

Current  Primary  MOS 

80-81 

Current  Status  Code 

82-83 

No.  of  Trailers 

T*1 

. . . T*24  x 

84 

268  Type  Transaction 

85-88 

269-272  Date  - 

89-91 

273-275  Transaction  Code 

1 Gain-Loss 

2 Pay  Grade 

3 Assignment 

4 Location 


5 Education 

6 Duty  MOS 

7 Primary  MOS 

8 Status  Code 


For  EMF  variables  (type  transaction  2-8),  new 
value  of  variable;  for  GLF,  3 character  SPN  or 
SPD  if  loss;  EBE,  etc.  if  extension;  GRF,  GHI, 
etc. , for  gains. 
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TABLE  A. 2 

SMALL  TRACKING  FILE 


1-4 

Cohort 

10-13 

Social  Security  Number 

14-15 

AFQT 

16 

Race 

17 

Sex 

18 

Term  Of  Service 

19 

Civilian  Education 

20 

Mental  Category 

23-25 

Age  at  Entry,  months 

21-22 

Q(F 

26-29 

BP  ED 

30-33 

ETS 

34-36 

UIC 

37 

(Recv) 

38-41 

BASD 

42-43 

Current  Assignment 

44-46 

Current  Duty  MOS 

47-49 

Current  Primary  MOS 

50-54 

Flags 

Initial  EMF  data  present 

Trailers  lost  (overflow) 

Extension 

Loss 

Return 

55-56 

Number  of  Trailers 

57-128 

Trailers:  . 

*1 

57  Type1  123 

Type 

58-59  Month  Service  124-25 

Month  Service 

60-62  Transaction  126-28 

Transaction 

1 


currently,  always 
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MI63,  MT67,  and  MLo8.  For  each  Individual  entering  the  Army  after  1 Jan  72, 
there  Is  one  record  (on  each  version  of  the  tracking  file).  This  record  con- 
sists of  the  person's  Social  Security  Administration  Number  (or  Temporary 
Identification  Number),  codes  indicating  characteristics  at  enlistment,  codes 
for  the  current  value  of  those  characteristics  which  can  change,  and  a series 
of  transactions  "trailers,"  one  for  each  transaction  relevant  to  ELIM- 
COMPLIP.  Each  trailer  contains  the  month  the  transaction  was  recorded  and 
the  type  of  transaction.  The  large  tracking  file  contains  slightly  more 


information  on  characteristics  than  the  small.  It  allows  for  more  trans- 
action trailers  to  accumulate  and  keeps  trailers  for  changes  in  character- 
istics as  well  as  for  gain  and  loss  transactions;  the  small  tracking  file 
contains  only  gain-loss  transactions.  The  large  file  records  SPN,  SPD, 
or  other  actual  code;  the  small  file  aggregates  into  16  categories.  The 
small  file  revises  ETS  date  when  an  extension  or  non-NPS  gain  occurs.  The 
large  file  is  in  a sense  an  archive,  from  which  status  of  Army  personnel 
at  any  time  in  the  past  can  be  reconstructed.  The  small  file  represents 
current  Army  personnel  status  and  is  designed  for  efficient  use  in  calculat- 
ing loss  rates  by  current  or  entry  characteristics.  Therefore,  the  large 
tracking  file  is  ordered  by  SSAN,  but  the  small  tracking  file  is  organized 
by  cohort. 

In  each  tracking  file,  following  the  SSAN  (or  TIN),  there  is  informa- 
tion on  the  individual's  race,  sex,  civilian  education,  etc.,  at  the  time 
of  entering  the  Army.  In  addition,  there  are  codes  for  current  character- 
istics, since  some  characteristics  which  the  Army  is  presently  interested 
in  can  change.*-  The  user  can  determine  from  the  file  each  individual's 
term  of  service,  time  to  ETS  (including  negative  t) , and  extendee  status- 
all  especially  important  for  the  desired  first  reenlistment  statistics. 

INTEGRATING  THE  DATA  BASE  INTO  ELIM-II  AND  ELIM-III 

Some  parts  of  the  ELIM-III  system  will  continue  to  operate  very  much 
as  they  have  in  the  past.  The  main  difference  is  that,  as  each  month's 
transaction  update  is  performed,  there  is  no  need  to  update  MT67B,  the 
large  numeric  qualitative  data  base.  This  data  base  currently  contains 
historical  statistics  for  150  quality  classes  by  month  of  service.  To 
provide  the  desired  flexibility  for  user  redefinition  of  classes,  this 
data  array  would  have  become  excessively  large  (>10,000  classes).  There- 
fore, it  was  replaced  with  the  tracking  file,  the  compressed  version  of 
the  Master  Transaction  Data  Base  MT63.  Each  month,  transaction  trailers 
are  added  to  records  in  the  tracking  file,  gains  are  added  to  it,  and 
current  characteristic  codes  are  changed  if  necessary  to  reflect  trans- 
actions such  as  a change  in  marital  status  or  number  of  dependents  or 


Both  entry  and  current  values  are  maintained  to  provide  the  user  the 
option  to  sort  on  status  at  either  time. 


education  level.  Then  a 40*-class  small  numeric  qualitative  data 
base  replacing  MT68  is  updated  directly  each  month.  When  the  user 
wants  to  redefine  quality  classes,  he  can  go  back  to  the  tracking  file 
and  recreate  the  smaller  data  base.  Procedures  for  updating  the  smaller 
data  base  are  discussed  in  more  detail  in  the  next  section,  follow- 
ing the  description  of  the  data  base. 

MODIFICATIONS  TO  MT68 

The  "small"  numeric  quality  data  base  consists  of  two  parts.  The 
main  part  stores  data  for  first-timers;  a second  part  contains  data  for 
extendees.  Conceptually,  the  first-timers  array  of  the  small  numeric 
quality  data  base  resembles  Figure  A.l.  The  shaded  box  represents  the 
data — extensions,  IMRE  losses,  etc. — for  cohort  1,  C-group  1,  in  the  first 
MServ.  It  currently  consists  of  11  items  of  information  but  might  be 
expanded  to  13  (see  the  next  section,  "Changing  an  Individual's  C-Group") . 
There  are  similar  sets  of  data  for  every  cohort,  C-group,  and  MServ  if 
the  cohort  is  old  enough  to  have  served  that  number  of  months. 


Figure  A.l.  Expanded  MT68 


Maximum . 
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All  members  of  a given  C-group  have  the  same  term  of  service,  h. 
Although  time  to  ETS,  t,  is  not  explicit  in  the  data,  it  is  completely  de- 
fined by  h and  MServ.  For  the  older  (earlier)  cohorts,  there  are  non-zero 
data  not  only  in  MServ  cells  1 through  12* h,  but  also  in  cells  with  MServ 
>12 *h,  for  individuals  with  t<0. 

The  second-timer  array  of  the  small  numeric  data  base  will  carry 
second-timers  through  loss  from  the  Army  or  for  54  months.  The  data  in 
it  will  be  used  for  producing  historical  data,  not  for  inventory  projec- 
tions. 


One  of  the  characteristic  codes  in  the  tracking  file  indicates  whether 
the  individual  is  a first-timer,  an  extendee,  a reenlistee,  or  a previous 
loss.  During  each  monthly  update,  as  each  transaction  is  added  to  the 
tracking  file,  this  characteristic  code  is  checked  to  determine  the  rele- 
vant array  for  the  affected  individual.  If  he/she  is  a first-timer,  the 
appropriate  C-group-MServ-cohort  in  the  first-timer  array  of  the  small 
numeric  data  base  is  manipulated.  In  addition,  if  the  transaction  is  an 
extension,  reenlistment,  or  loss,  the  individual  is  added  to  the  extendee 
or  "careerist”  array,  if  appropriate,  and  the  characteristic  code  for  the 
storage  array  is  updated.  The  "careerist"  array  becomes  a reenlisted  array. 
Because  an  individual  can  reenlist  before  completing  36  months  of  service, 
this  array  is  modified  to  cover  MServ  <37.  In  addition,  the  ELIM-III  DPM 
is  modified  so  that  individuals  are  assigned  to  this  array  only  when  they 
reenlist.  Currently  an  individual  can  be  assigned  to  the  careerist  array 
if  he/she  is  an  extendee  with  MServ  >36;  if  such  a person  subsequently 
reenlists,  his/her  first  reenlistment  appears  to  be  a career  reenlistment. 


If  the  transaction  affects  an  individual  in  the  extendee  array,  the 
appropriate  cell  in  that  array  is  updated  and  the  other  procedures  described 
for  first-timers  would  be  executed.  If  the  affected  Individual  is  a re- 
enlistee, ELIM-II  is  already  accounting  for  him/her  and  the  only  action 
taken  will  depend  on  the  type  of  loss  and  will  parallel  procedures  in  the 
current  system. 


A- 6 


With  separate  data  arrays  for  first-timers,  extended  first-timers, 
and  reenlisted  personnel,  it  is  possible  to  calculate  reenlistment  rates 
separately  for  first  and  for  subsequent  reenlistments.  This  is  not  pos- 
sible in  ELIM-III . 


Since  first  timers  are  to  be  carried  by  cohort,  no  explicit  monthly 
aging  will  be  needed;  it  will  be  automatic.  Extendees  will  be  carried  by 
months  to  ETS,  which  must  be  reduced  by  1 each  month  (unless  a new  ex- 
tension is  executed) . 

Each  person  remains  in  the  small  numeric  data  base  for  first  timers 
until  the  earliest  of  (a)  separation  from  service  (loss) ; (b)  IMmediate 
REenlistment  loss;  (c)  extension;  or  (d)  completion  of  54  months  of  service. 
The  54-month  cutoff  allows  for  reenlistment  of  all  persons  with  4-year 
first  enlistments  and  up  to  a 6-month  extension.  Extendees  are  carried 
until  they  reenlist  or  separate.  Reenlistees  are  carried  in  a careerist- 
type  array. 

The  small  numeric  data  base  (C-group  arrays)  is  saved  each  month  (as 
MT68B  is  now)  and  used  as  input  (MT68A)  the  next  month.  In  fact,  if  the 
user  is  experimenting  with  various  sets  of  C-groups,  several  versions  of 
the  C-group  arrays  can  be  saved  until  a final  determination  as  to  the 
desired  characteristics  is  made. 

CHANGING  AN  INDIVIDUAL'S  C-GROUP1 

In  each  monthly  update  of  the  data,  individuals  can  be  classified  on 
the  basis  of  the  current,  not  entry,  characteristics.  Thus,  an  individual 
need  not  be  in  the  same  C-group  every  month.  For  example,  if  an  individual 
were  single  from  MServ  ■ 1 until  a marriage  in  MServ  • 28,  he/she  would  be 
included  in  the  inventory  of  a C-group  for  single  . . . persons  with  each 
monthly  update  until  MServ  28  and  thereafter  in  a different  C-group,  one 
for  married  ...  persons  (with  "..."  signifying  otherwise  similar  character- 
istics) . 

*This  capability  is  not  provided  under  the  current  contract;  but  it  will 
be  possible  to  include  it  in  ELIM-IV,  with  a modest  additional  effort. 


This  necessitates  the  introduction  of  an  additional  loss  and 


an  additional  gain  category,  thus  expanding  the  data  in  MT68  from  11 
to  13  words  per  cohort,  C-group,  and  MServ,  as  noted  above.  That  is, 
each  cell,  as  represented  by  the  shaded  area  in  Figure  A.l,  would  be 
expanded  to  make  room  for  a ninth  loss  category,  transition  to  a differ- 
ent C-group,  and  a new  gain  category,  transition  from  a different  C- 
group.  If  changes  in  marital  status  were  the  only  transitions  to  be 
considered,  C-groups  for  single  persons  could  have  losses  due  to  marriage 
and  gains  due  to  divorce  (if  divorced  and  single  status  are  deemed 
equivalent) ; C-groups  for  married  persons  could  have  transitional  gains 
due  to  marriage  and  losses  due  to  divorce. 

Source  and  destination  C-groups  would  not  be  linked;  that  is,  an 
individual  could  move  from  a C-group  to  any  other  C-group.  There  would 
not  be  a set  of  C-groups  for  single  individuals  and  a parallel  set  for 
married  individuals  with  one-to-one  links  between  them. 

When  the  data  base  is  updated,  the  presence  of  a transaction  in- 
volving a C-group  transition  will  produce  a loss  in  the  individual’s 
beginning-of-month  C-group  and  a gain  in  his/her  end-of-month  C-group. 

If  a set  of  redefined  C-groups  is  to  be  created  anew,  routines  will 
have  to  be  devised  that  keep  track  of  C-group  for  the  MServ  being  dealt 
with  at  that  stage  in  the  data  creation.  This  C-group  need  not  necessar- 
ily be  the  same  as  the  entry  or  the  current — i.e.,  most  recent — C-group. 

INVENTORY  PROJECTIONS 

Although  the  historical  data  base  could  accommodate  changes  in 
C-group,  the  Inventory  Projection  Module  could  not,  without  great  effort, 
be  altered  to  allow  for  changes  in  C-group  in  the  projection  out-months. 
Ideally,  and  eventually,  before  loss  rates  are  applied  to  the  beginning 
inventory  for  each  out-month,  transition  loss  and  gain  (marriage  and 
divorce)  rates  would  be  used  to  reapportion  the  inventory  among  the  C- 
groups. 

On  the  other  hand,  if  a characteristic  which  can  change  values, 
such  as  marital  status,  is  used  for  C-group  assignment  and  projections 
are  made  without  C-group  transitions  in  the  out-months,  then  projections 


r 

in  the  out-months  are  likely  to  become  increasingly  inaccurate,  the 

farther  into  the  future  we  project.  This  inaccuracy  is  expected  because, 

i 

as  the  inventory  is  aged,  more  and  more  persons  will  actually  be  married, 
but  the  projection  system  will  not  move  individuals  between  single  and 
married  C-groups.  Most  NPS  gain  inputs  to  MServ  = 1 will  be  single;  and 
as  they  are  aged,  the  projected  inventory  will  have  an  unrealistically 
high  percentage  of  single  people. 

The  modifications  to  ELIM  greatly  expand  the  number  of  C-groups  in 
the  "small"  numeric  data  base.  The  larger  numeric  data  base  has  been  re- 
placed by  a tracking  file  with  a record  for  each  individual.  The  record 
contains  information  for  C-group  assignment  and  transaction  information. 

• The  data  enable  cohort  tracking  by  C-group  through  first  reenlistment, 

permit  simplified  C-group  redefinition,  and  carry  information  on  charac- 
teristics such  as  marital  status  that  were  not  previously  used  for  C-group 
classification. 
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APPENDIX  B 

CAREER  FORCE  OPTIMIZATION  (CFO) 

STATEMENT  OF  THE  PROBLEM 

The  Army  desires  the  capability  to  optimize  the  career  strength  over 
a specified  year  of  service  (YOS)  distribution  as  defined  by  the  Enlisted 
Force  Master  Plan.  Gains  to  the  career  force  come  from  several  different 
sources , including : 

• First  timers  with  36  months  of  completed  service 

• Reenlistees  of  several  types 

• Army  and  non-Army  prior  service  enlistees  with  36  months 
or  more  of  prior  service 

Not  all  career  gains  will  have  the  same  number  of  years  of  service,  there- 
fore, it  is  desirable  that  COMPLIP  be  able  to  optimize  the  gains  from  each 
category,  within  user  supplied  constraints,  such  that  the  Career  Force  YOS 
distribution  is  driven  towards,  and  ultimately  reaches,  the  desired  distri- 
bution within  a user-specified  length  of  time. 

Currently,  the  Army  uses  a 5-year  planning  cycle  for  setting  and 
working  towards  a desired  YOS  distribution.  Within  the  plan,  it  attempts 
to  optimize  the  force  within  the  next  3 years.  The  current  plan  specifies 
an  optimum  force  strength  to  be  reached  in  FY  80.  These  are  annual  strengths 
which  are  determined  as  of  30  September  of  each  fiscal  year.  Because  the 
Army  plan  extends  for  a maximum  of  5 years,  there  is  no  need  to  change 
ELIM-COMPLIP  in  terms  of  the  number  of  years  that  can  be  forecast. 

REQUIRED  SYSTEM  CAPABILITY 

Currently,  the  ELIM-COMPLIP  system  handles  monthly  data  and  distrib- 
utes the  career  force  into  two  categories:  retirement  eligible  (RE)  and 
non-retirement  eligible  (NRE) . To  optimize  a YOS  distribution,  it  is 

necessary  to  compute  strength  by  YOS  (1,2,3 20,20f).  In  order  to 

maintain  this  distribution  over  the  period  covered  by  the  manpower  program, 
factors  are  needed  to  age  the  appropriate  fraction  of  the  population  month 
by  month  from  one  YOS  to  the  next. 


The  analytical  problem  becomes  one  of  minimizing  the  weighted  sum  of 
the  deviations  of  the  actual  from  the  planned  career  force  strength,  by  YOS, 
subject  to  constraints  on  total  strength,  the  supply  of  NPS  enlistees,  the 
supply  of  PS  enlistees  by  YOS,  and  the  first-term  reenlistment  rates  by 
years  of  service.  (The  current  Army  policy  is  to  exercise  control  only  at 
the  first  reenlistment.  Beyond  the  first  reenlistment,  the  Amy  generally 
keeps  all  people  who  desire  to  reenlist.  However,  the  system  will  have  the 
capability,  through  user  input,  to  override  later  reenlistment  rates  so  that 
the  user  may  experiment  with  alternative  reenlistment  policies.) 

METHODOLOGY 

The  development  of  the  CFO  methodology  was  separated  into  two  phases. 
In  Phase  I,  the  YOS  distribution  for  the  career  force  was  developed  and  in 
Phase  II,  the  harder  problem  of  optimizing  the  various  gains  to  the  career 
force  to  meet  a specified  career  force  profile  has  been  attacked. 

Phase  I 

The  methodology  for  this  phase  involved  changes  to  the  DPM,  FAC,  IPM, 
COMPLIP  equations,  and  the  report  generator  to  produce  a YOS  distribution 
of  enlisted  strength.  This  has  been  discussed  in  previous  concept  papers. 

Phase  II 

Solution  of  this  phase  represents  one  of  the  most  difficult  tasks  of 
any  in  the  history  of  ELIM-COMPLIP  development.  The  following  sections 
discuss  the  alternatives  considered  and  the  alternative  used  for  solution 
of  the  task. 

The  Objective  Function 

From  the  preceding  discussion,  it  is  clear  that  the  objective  func- 
tion should  model  the  Amy’s  goal  of  reducing  the  deviation  between  the 
career  force  strength*"  and  a target  profile.  Specifically,  minimize 

^Although  the  term  "career  force  strength"  will  be  used  for  clarity  through- 
out this  paper,  the  optimization  will  minimize  the  weighted  difference  be- 
tween actual  and  target  strength  for  YOS  1,  2,  and  3 as  well  as  the  career 
force. 
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the  weighted  sum  of  the  absolute  values  of  the  differences  between  pro- 
jected year -end  trained  strength  of  the  career  force  by  years  of  service 
and  target  values  (Career  Force  Profile)  In  the  target  year.  It  is  not 
necessary  to  have  additional  profiles  in  each  of  the  years  preceding  the 
target  year  since  the  target  profile  at  the  end  of  year  .OX  will  drive 
all  accessions  that  will  end  up  in  that  profile.  Target  profiles  may 
not  be  needed  before  the  target  year  but  may  be  desirable.  Profiles 
after  the  target  year  are  certainly  needed. 

An  alternative  to  making  the  user  create  a target  profile  for 
each  year  of  projection  is  to  use  the  same  profile  as  in  the  target 
year  but  relax  the  acceptable  deviations  more,  the  further  away  the 
projection  year  is  from  the  target  year.  That  is,  if 

FCZ  ■ fraction  of  the  target  that  represents  acceptable 
deviations 

RCF  - factor  for  adjusting  FCZ  for  other  than  target  years,  RCF  > l 

NYR  » target  year 

IYR  " projection  year 

then,  define  FC(IYR)  as 

IYR-NYR 

FC(IYR)  - FCZ  * RCF 


Note  that  FC(IYR)  - FCZ  when  IYR  - NYR 


FCTARiJ j - FC(IYR)  * CTARGnjj  ; i - IYR,  n - NYR 


where  CTARGnjj  is  the  target  profile  in  the  target  year  n and  FCTARijj 
represents  the  acceptable  deviations  in  projection  year  i. 
Graphically,  FC(IYR)  is  as  shown  in  Figure  B.l. 


. 
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Figure  B.l.  Graph  of  FC(IYR) 

The  user  would  provide  the  desired  force  distribution,  the  year  by 
which  it  is  to  be  met,  the  weights  for  the  difference  between  YOS  strength 
and  target  for  each  YOS,  and  the  factor  which  moves  the  force  towards  that 
target.  The  formulation  for  the  objective  function  is  presented  in 
Annex  1 of  this  appendix. 

The  concept  for  the  objective  function  follows  from  the  problem  state- 
ment; however,  the  concepts  for  varying  reenlistment  gains  are  not  as 
straightforward . 

Because  most  of  the  gains  to  the  career  force  enter  with  3 and  U years 
of  service,  there  is  very  little  that  can  be  done  in  the  short  term  to 
adjust  current  deviations  beyond  YOS  10.  However,  this  is  not  a significant 
problem,  for  two  reasons:  (1)  approximately  two-thirds  of  the  absolute 
value  of  the  deviations  from  the  current  force  profile  occur  within  YOS  1 


through  10,  with  the  greatest  deviations  being  in  YOS  4 through  7,  and 
(2)  beyond  YOS  10,  the  retention  rates  are  quite  steady.  This  means  that 
if  the  curve  for  YOS  1 through  10  can  be  adjusted  in  the  short-term,  the 
corrections  to  the  rest  of  the  curve  should  come  automatically  with  time. 

Varying  First  Reenlistments 

To  model  a CFO  decision  process,  first  reenlistments  must  be  allowed 
(within  specified  limits)  to  vary  independently  of  NPS  gains.  However, 
the  number  of  first  reenlistments  by  a given  cohort  is  actually  the  product 
of  factors  representing  attrition  and  reenlistment  rates  and  a variable 
representing  the  number  of  NPS  gains  for  that  cohort.  If  the  reenlistment 
rate  is  replaced  by  a variable,  then  the  number  of  reenlistments  becomes 
the  product  of  two  variables — a nonlinear  function  which  is  nonconvex. 
Nonconvex  functions  generally  preclude  both  linear  programming  and  the 
efficient  solution  methods  for  convex  nonlinear  problems.  Conceptually, 
they  are  in  a category  with  the  integer  problems  which  lack  a network 
structure;  problems  of  this  type  with  more  than  six  to  ten  variables  are 
often  considered  "large"  and  may  require  several  hours  or  more  to  solve. 
Therefore,  it  is  clear  that  it  is  not  reasonable  simply  to  add  additional 
variables,  representing  reenlistment  factors,  to  the  present  COMPLIP  con- 
straint set. 

Since  the  idea  of  replacing  the  linear  program  in  COMPLIP  by  a non- 
linear program  is  infeasible,  other  alternatives  must  focus  on  the  treat- 
ment of  the  terms  representing  reenlistment  gains.  (The  term  "reenlistment" 
in  subsequent  discussions  will  always  imply  "first  reenlistments.")  Non- 
linearity can  be  avoided  by  modeling  reenlistment  losses  as  they  currently 
are,  that  is,  as  linear  functions  of  NPS  gains. 

Iterating  Between  Successive  LPs 

One  approach  is  to  develop  a linear  program  which  would  only  vary 
the  reenlistment  factors,  with  NPS  gains  being  fixed.  In  operation, 

COMPLIP  would,  using  the  current  LP,  find  a solution  vector  of  NPS  gains 
which  optimized  the  CFO  objective  function  with  reenlistment  rates  fixed; 
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then  the  same  objective  function  would  be  solved  by  another  LP  formulation 
which  holds  NPS  gains  fixed  at  the  previous  "optimal"  value  and  solves  for 
a factor  of  reenlistment  factors.  These  factors  would  become  constants  in 
the  "original"  LP  and  it  would  be  solved  again  for  a new  set  of  NPS  gains. 
The  iteration  would  continue  until  there  was  negligible  improvement  in  the 
objective  function. 

This  technique  avoids  dealing  with  nonlinear  programs  but  the  itera- 
tive process  raises  other  problems.  There  is  no  way  of  knowing  how  many 
iterations  would  be  required  for  convergence,  or  if  the  process  would 
converge  at  all.  Convergence  characteristics  may  be  a function  of  weight- 
ing factors  in  the  objective  function  or  combinations  of  values  for  the 
technology  coefficients.  Quite  possibly  the  operator  would  have  no  method 
of  forecasting  convergence.  Other  difficulties  include  the  mechanics  of 
communication  between  the  two  LPs  and  the  increased  maintenance  load  which 
such  a complex  routine  could  place  on  support  personnel.  A solution 
scheme  not  requiring  iterations  would  be  more  desirable. 


Separating  Reenlistment  Gains  and  Losses 

Another  method  to  avoid  nonlinearity  is  to  disconnect  the  computa- 
tion of  reenlistment  gains  and  losses.  Rcaulistment  losses  would  remain 
functions  of  NPS  gains  and  the  appropriate  factors;  however,  reenlistment 
gains  would  become  a new  COMPLIP  variable  which  would  be  upper  and  lower 
bounded  by  a user-supplied  linear  function  of  reenlistment  losses.  In 
effect,  this  would  permit  the  first  termers  and  career  force  to  vary 
independently  (within  reasonable  limits).  The  basic  COMPLIP  structure 
could  be  retained  as  all  variables  would  be  linear  and  solution  iterations 
would  not  be  required.  COMPLIP  would  not  be  modified  other  than  those 
changes  necessary  to  add  additional  variables  and  constraint  equations 
and  an  additional  objective  function. 

Accounting  for  Career  Force  Losses 

Another  problem  to  deal  with  is  the  computation  of  losses  to  the 
populations  that  have  reenlisted.  Heretofore,  the  post-reenlistment 
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losses,  and  thus  the  contribution  to  the  inventory  of  those  who  have  re- 
enlisted, were  modeled  in  the  IPM  by  extension/reenlistment  retention 
rates.  Ther.e  rates  were  passed  to  COMPLIP  and  became  technology  coef- 
ficients representing  the  reenlisted  population  as  a proportion  of  NPS 
gains.  This  relationship  would  not  be  maintained  under  the  above  scheme 
since  the  "optimal"  reenlistment  factors  would  not  be  known  until  comple- 
tion of  the  COMPLIP  run. 

One  approach  would  be  to  create  an  additional  set  of  rates  for  the 
IPM  to  represent  the  losses  to  the  reenlisted  population.  COMPLIP  does 
not  require  career  losses  by  type  of  loss,  it  simply  needs  to  know  how 
many  post-reenlisted  are  present  in  the  inventory.  Given  the  number  of 
reenlisted  (actually  a variable  representing  reenlisted),  aggregate  factors 
could  be  used  to  model  the  attrition  of  the  reenlisted  in  COMPLIP.  This 
would  be  accurate  enough  for  COMPLIP' s requirements  but  would  not  provide 
information  on  the  various  categories  of  losses  for  the  Report  Generator. 

In  other  words,  once  the  correct  numbers  of  reenlistments  have  been  selected, 
there  must  be  a projection  of  losses  by  type. 

Losses  by  type  could  be  determined  by  constructing  this  capability 
in  the  Report  Generator.  Actually  this  would  require  building  a sort  of 
IPM.  It  would  seem  more  logical  to  use  the  loss  projection  capability 
that  already  exists — in  the  IPM.  This  suggests  that  after  COMPLIP  deter- 
mines the  optimum  reenlistment  rates  (as  well  as  NPS  gains,  etc.),  we 
return  to  the  IPM  to  allocate  the  career  losses.  Since  NPS  gains  are 
known,  the  IPM  would  produce  an  "all  people  run"  (that  is,  retention 
rates  would  not  be  used),  losses  would  be  correctly  allocated,  and  the 
results  would  proceed  to  the  Report  Generator.  To  format  these  results 
for  the  Report  Generator,  it  will  be  necessary  for  the  solution  to  pass 
via  the  COMPLIP  solution  file,  but  the  LP  solution  would  be  immediate. 

This  approach  solves  another  problem,  that  of  properly  allocating 
reenlistments.  Although  feasible,  it  would  appear  unwieldy  and  very 
burdensome  to  the  operators  for  COMPLIP  to  compute  different  reenlist- 
ment factors,  for  each  projection  month,  for  each  of  40  C- groups. 
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Since  the  Army  does  not  goal  reenlistments  by  C-groups,  and  the  cost  In 
computer  time  for  COMPLIP  to  model  variable  reenlistments  for  each  C- 
group  could  easily  add  several  hours  to  a COMPLIP  run,  we  propose  to  have 
COMPLIP  determine  an  aggregate  reenlistment  factor  for  all  C-groups  (and 
all  terms  of  service).  Then,  after  COMPLIP  is  solved,  but  before  return- 
ing to  the  IPM  for  the  second  pass,  a new  processor  would  allocate  the 
aggregate  reenlistment  rate  over  all  C-groups  in  proportion  to  the 
original  rates.  The  second  pass  to  the  IPM  would  then  allocate  correctly 
the  reenlistments  by  C-groups.  This  scheme  for  assigning  reenllstments 
to  C-groups  would  preserve  the  proportion  of  reenlistments  dictated  by 
reenllstment  losses  unless  the  user  desires  otherwise. 

Prior  service  gains  will  continue  to  be  treated  in  the  same  manner 
as  in  the  current  system. 

The  CFO  equations  and  additional  comments  on  the  methodology  are 
contained  in  the  annex  to  this  appendix. 


ANNEX  B1 

CFO  COMPLIP  EQUATIONS 

This  annex  contains  the  basic  COMPLIP 
equations  to  be  implemented  for  career 
force  optimization  (CFO). 


n 


n 
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COMPLIP  EQUATIONS  FOR  CAREER  FORCE  OPTIMIZATION 


OBJ. . .07  - objective  function 

Y NYS 

min.  Yj  E [w^„(jj)*ACPOSnjj-W  „(jj)*ACNEGnjj 
ttt  acp  acn 

n-l  jj“l 

+ Wexp ( j j ) *EXPOSnj  j+Wexn( j j ) *EXNEGnj  j] 

where , 

multiplication  is  denoted  by  * 
n is  the  target  year 

NYS  is  the  maximum  number  of  YOS  categories 

y is  the  number  of  years  in  manpower  program 

jj  is  the  year  of  service  (YOS)  index 

ACPOSnjj  is  the  variable  for  the  number  of  acceptable 
positive  deviations  from  target  in  year  n for  YOS  jj 

ACNEGnj j - acceptable  negative  deviations 

EXPOSnjj  - excessive  positive  deviations 

EXNEGnjj  - excessive  negative  deviations 

Wacp  ~ OBvr,,,07  weight  for  ACPOSnjj 

Wacn  ~ OBJ... 07  weight  for  ACNEGnj j 

W - OBJ... 07  weight  for  EXPOSnjj 
exp 

W - OBJ... 07  weight  for  EXNEGnjj 
exn 

E.CFDnj j - CF  deviations  from  target 

CF.Y.nj j - ACPOSn j j-EXPOSnj  j+ACNEGnj  j+EXNEGn j j = CTARGnjj 

for  n-l,2,...,Y;  j j*I ,2, . . .NYS 
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where 


CF.Y.njj  - denotes  the  career  force  (CF)  strength  in  target 
year  n with  YOS  jj 

CTARGnjj  - denotes  the  CF  strength  in  year  n with  YOS  jj 
Note:  Currently  NYS  = 21  and  YOS  21  includes  YOS  > 20 

UCDACnll  - upper  limit  on  acceptable  deviations 

ACPOSnj  j+ACNEGnj  ,1  <.  FCTARnj  j 
for  n-1,2,... ,Y;  j j-1,2, . . . ,NYS 

where , 

FCTARnj j = FC(n)*CTARGnj j 

FC(n)  =■  user-supplied  factor  for  computing  acceptable  limit 
of  deviations  from  target 

C. OBJ. 07  - constraint  on  OBJ... 07 


Y NYS 

Z Z [w  (j j )*ACP0Snj j+W  (j j )*ACNEGnj j 
n-1  jj-1  acP  aCn 

+ Wexp(jj)*EXPOSnjj+Wexn(jj)+EXNEGnjj] 

where , 

■ slightly  larger  than  the  OBJ... 07  functional 

E . CFYnj j - CF  strength  in  target  year  n with  YOS  jj 

6 

CF.Y.njj  - 52  Z IRG.Ukk*n  (kk,12n) 

U-3  toP,.  IRG 

itjn 


. 


NCG34 

- £ . £ FTQCqqkk*n  (kk,12n) 

qq=l  kk6P£jn 

j^jj 

£=0 


- Z FTRTT.kk*n  (kk,12n) 
kk«P*jn 


j=jj 


l-U 


NCG256  

" -Z  Z FTQ2qqkk*7l  (kk,  12n)  = CFP.Ynjj 
qq=l  kk€P£jn 

Jl=0 

j=jj 


where , 

CF.Y.njj  * CF  strength  in  target  year  n with  YOS  jj. 

IRG.!l!lkk  « IRG  in  COMPLIP  month  kk  with  YOSO.  (e.g., 

££=04  for  months  of  service  37,38, ...  ,48) . 

FTQCqqkk  = FT  gains  for  CGqq  in  COMPLIP  month  k. 

HjRG(kk,12n)  = retention  rate  of  IRG  in  COMPLIP  month  of 
accession  kk  at  end  of  COMPLIP  month  12n. 

H (kk,12n)  * retention  rate  of  FT  terms  3 and  4 CG  qq 

qq 

gains  in  COMPLIP  month  kk  at  end  of  COMPLIP  month  L2n. 
n „(kk,12n)  * retention  rates  of  FT  terms  2,  5,  6 CGqq 

qqz 

gains  in  COMPLIP  month  kk  at  end  of  COMPLIP  month  12n. 

CFP.Ynjj  - CF  (people  not  retention  rates)  with  YOS  jj  at 
the  end  of  year  n. 

Pijn  "{  kk  I j-1<u+n-  1T r<j} 

The  retention  rates  nTTjr,  will  be  computed  in  the  IPM  for  YOS  3,  4, 

IKu 

5,  6 at  the  time  of  reenlistment.  These  retention  rates  are  applicable 
only  to  first  reenlistments.  Immediate  reenlistment  losses  will  be 

B-12 


computed  in  the  IPM  for  all  data  array  cells  for  which  there  are 
non-zero  immediate  reenlistment  loss  rates.  However,  immediate 
reenlistment  gains  will  be  added  only  for  those  corresponding  to 
second  or  later  reenlistments.  The  IRG  for  the  first  timer  reenlist- 
ments will  be  accounted  for  by  means  of  the  COMPLIP  variable 
IRG.Ukk. 

UIRG j j ii  - upper  constraint  on  first  IRG  in  COMPLIP  month  ii  with  YOSj  j 

NCG34 

IRG. j jii-UF  J]  53  FTQCqqkk*niRL(qq,kk,ii) 

qq=i  kk€Fkji 
JJ-j-3,4 
i=ii 
k=kk 


NCG256 

-UF  53  '53  FTQ2qqkk*Tl  (qq.kk.ii) 

i"-1  “«\,i 

jj=j=5 
i=ii 
k=kk 


<UF  * PIRLjjii 


LIGRjjii  - lower  constraint  on  first  IRG  in  COMPLIP  month  ii  with 


YOSjj. 


NCG34 


IRG.  j j ii-LF  53  51  FTQCqqkk*Tl  (qq.kk.ii) 


qq=*l  kk€F 


kj  i 
j j*j=3,4 
i=ii 
k=kk 


NCG256 

-LF  .5]  £ FTQ2qqkk*nlSL2  (qq.kk.ii) 

qq=l  kk«FkJi 

i=ii 

k-kk 


>LF  * PIRLjjii 
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where 


E.ENLSii 


UF  and  LF  are  user  supplied  factors  defining  the  first 
IRG  constraint  band  width  (e.g,,  UF=1.05,  LF=.95). 

niRl/qq,kk’ii)  = Fraction  oF  FT  f°r  terms  3 and  4 

with  month  of  accession  kk  who  are  IR  losses  in  COMPLIP 
month  ii. 

r'lRL2  (*W  »kk,ii)  = fraction  of  FT  CGqq  (for  terms  2,  5,  6) 
with  month  of  accession  kk  who  are  IR  losses  in  COMPLIP 
month  ii. 

PIRLiijj  represents  IRL  from  the  "people"  inventory  that 
are  in  the  first  reenlistment  window  in  COMPLIP  month 
ii  with  YOS  j j . 

- enlisted  strength  computation  ' 

ENLS . . ii  - | ENLS . . kk  - NPSG. . ii+AUSL. . ii  + FTL. . .ii 
kk=ii-l 


- EIRG- 

all  jj 


ii-1 

Jjii+  E 

kk=ir(2)-a+l 
all  jj 


fnIRG  (kk , ii-i)  -nIRG  (kk , ii )]  IRG . j j kk 


= NGLii 


for  ii  = Ag+1, . . . ,M+5. 
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Following  is  a simple  schematic  for  Career  Force  Optimization: 


[IPM  - CF0| 
run 


COMPLIP 
CFO  run3 


Compute  new  IR  loss 
rates,  fix  NPS  and  IR | 
gains  for  IPM  input 


COMPLIP  run 

for  reporting}^- 
b 

purposes 


IPM  - all  people | 
run  (non-CFO) 


*In  this  pass  COMPLIP  can  have  three  objective  functions  optimized 
in  sequence,  e.g., 

OBJ... 02  - operating  strength  optimization 
OBJ... 07  - career  force  optimization 
OBJ... 03  - REP  optimization 

"’in  this  pass  only  OBJ... 03  is  used. 


APPENDIX  C 

USER  MODIFICATIONS  OF  LOSS  FACTOR  PROJECTIONS  BY  OPERATIONS 
ON  LOSS  DATA  AGGREGATED  BY  ACCESSION  COHORT 
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BACKGROUND 

In  earlier  versions  of  ELIM,  inventories,  gains,  and  losses  are 
tracked  by  month  of  service  for  first-timers  through  their  first  21  months. 
Loss  rates  are  projected  by  extension  of  time  series  of  data  by  month  of 
service.  No  attempt  is  made  to  modify  loss  rates  in  a given  month  of  service 
due  to  changes  in  loss  rates  for  earlier  months  of  service  for  that  same 
accession  cohort.  This  factor  projection  and  modification  capability  is  in 
conflict  with  manpower  planning  procedures  and  OSD  guidance  where  acceptable 
attrition  is  now  being  defined  on  a cumulative  basis  by  accession  cohort. 


DATA  AVAILABILITY 

The  revised  ELIM-IV  qualitative  data  base  carries  transactions  out  to 
at  least  48  months  of  service  and  will  be  constructed  for  ease  and  efficiency 
of  retrieval.  All  the  historical  data  required  for  construction  of  cohort 
retention  curves  over  months  of  service  is  in  the  new  data  base. 

GENERAL  METHODOLOGY 

In  ELIM-III,  loss  rates  are  projected  and  displayed  as  a monthly  time 
series  for  a particular  month  of  service  (and  particular  C-group)  as  in 
Figure  C.l. 


Figure  C.L  Historical  and  Projected  Loss  Rates  for  Month 
of  Service  "X,"  C-Group  "Y, " and  Loss  Type  "Z" 
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Given  a complete  set  (all  months  of  service,  all  loss  types  for  a 
given  C-group)  of  data  like  Figure  C.l,  it  is  possible  to  create  the  cohort- 
oriented  plots  of  Figure  C.2.  While  the  transition  to  a plot  of  a single 
month  cohort  is  obvious,  it  is  also  probably  impracticable.  However,  given 
a set  of  monthly  factors  (great  accuracy  not  required)  for  accession  levels, 
these  data  can  be  easily  aggregated  up  to  annual  or  semiannual  cohorts. 


Figure  C.2.  Historical  and  Projected  Loss  Rates  for  Cohort  "X" 
and  C-Group  ”Y" 


These  plots  in  Figure  C.2  are  in  terms  that  are  more  useful  for  pro- 
jection purposes  when  losses  are  constrained  by  accession  cohort.  The 
Cohort  Targeting  System  provides  simple  user  modification  tools  for  these 
types  of  data.  The  user  can  (iteratively)  modify  either  or  both  forms. 

For  the  conditional  loss  rates  on  the  left,  the  user  specifies  the  month  of 
service  range  and  the  percent  reduction  in  the  loss  rate  (Figure  C.3).  For 
the  cumulative  loss  rates  on  the  right,  the  user  specifies  the  cumulative  rate 
at  an  end  point,  probably  36  or  48  months  of  service,  and  the  month  of 
service  range  over  which  required  changes  in  loss  rates  are  to  be  made  to 
achieve  the  designated  target  loss  rate  (Figure  C.4). 
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Figure  C.4.  Projected  Cumulative  Loss  Rate  with  Losses  in  Months 
of  Service  24-48  Reduced  to  Achieve  a Percentage 
Reduction  in  the  Cumulative  Loss  Rate  at 
48  Months  of  Service 
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Once  the  user  has  arrived  at  versions  of  the  loss  projections  by 
cohort  as  in  Figure  C.2,  these  modifications  are  disaggregated  back  to 
the  form  required  by  the  QIPM,  i,e.,  as  in  Figure  C.l. 

OPERATIONAL  CONSIDERATIONS 

For  several  reasons,  this  capability  exists  independent  of  the 
current  (or  planned)  production  flow.  That  is,  its  use  is  not  required 
for  successful  operation  of  ELIM-IV,  This  program  operates  interactively, 
so  that  the  user  can  conveniently  select  a cohort,  see  the  loss  patterns 
that  QFAC  is  passing  to  QINV,  and  iteratively  modify  the  cohort  loss  rates 
to  accomplish  a convergence  to  the  pattern  which  satisfies  all  the  users' 
constraints.  It  is  also  possible  to  get  hard  copy  outputs  so  that  the  user 
does  not  have  to  make  all  modifications  in  a single  pass. 

Nevertheless,  under  either  mode  of  operation  the  principal  considera- 
tion still  is  that  this  capability  has  been  constructed  as  an  optional  en- 
hancement to  the  normal  procedure,  that  the  input  from  this  program  uses  the 
basic  QFAC  projections,  and  that  the  output  is  in  the  form  of  revised  QFAC 
projections  (at  least  to  the  extent  that  the  user  wishes  to  modify  the  QFAC 
outputs  as  a result  of  his  cohort-based  loss  analysis) . 

AN  INITIAL  COMMAND  SET 

To  implement  this  system,  a program  has  been  written  to  accept  user 
commands  and  act  on  them.  The  specifications  for  the  primary  commands  are 
provided  in  the  following  pages. 
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COHORT  TARGETING  SYSTEM  COMMANDS 


COHORT  — SELECTS  USER-SPECIFIED  SECTION  OF  DATA 
MODIFY  — AFFORDS  USER  MEANS  BY  WHICH  TO  MANIPULATE  DATA 
DISPLAY  — PRODUCES  GRAPH<S>  OF  USER-SPECIFIED  SECTIONS  OF  DATA 
KEEP  — ALLOWS  USER  TO  STORE  DATA  FOR  FUTURE  RETRIEVAL 
LORD  — AFFORDS  USER  ACCESS  TO  PREVIOUSLY  STORED  DATA 
REPLACE  REPLACES  ORIGINAL  DATA  WITH  MODIFIED  DATA 

HARDCOPY  — ALLOWS  CREATION  OF  PRINT-FILE  CONTAINING  GRAPHS  AND  ALL 
ACCEPTED  COMMANDS * AND  USER-GENERATED  NOTES 

PARAMS  — PROVIDES  INFORMATION  PERTINENT  TO  CURRENT  COHORT 

SCALE  — ALLOWS  USER  TO  SPECIFY  FREQUENCY  OR  CUMULATIVE  SCALE  TO 
BE  USED  FOR  GRAPHING  PURPOSES 

TERMINATE  — CAUSES  CTS  SESSION  TO  END 

<HELP>  ? — PROVIDES  USER  WITH  INFORMATION  REGARDING  COMMAND 
FUNCTION*  SYNTAX  * AND  OPERANDS 

QUERY  — LISTS  KEEP-SETS  IN  USE  AND  THEIR  LABELS*  STATUS  OF 

HARDCOPY*  AND  CURRENT  VALUES  OF  CUMULATIVE  Sc  FREQUENCY 
SCALES 

NOTE  — ALLOWS  USER  TO  RECORD  NOTES  BOTH  ON  TERMINAL  AND  TO 
PRINT-FILE 

FREE  — RELEASES  USER-SPECIFIED  KEEP-SETS  FDR  RE-USE 
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COHORT 


0 


In  010 

01 

In  020 

01 

FUNCTION: 

THE  COHORT  COMMAND  ALLOWS  THE  USER  TO  SELECT  A 

01030 

01 

SPECIFIC  SECTION  OF  DATA  FOR  MODIFICATION  OR 

yn  040 

01 

EXAMINATION. 

(1050 

01 

1 06  0 

01 

SYNTAX: 

COHORT  'YYMM'  ('NMONO  TERM  C'Tl  ' > . . . 'T5'>  CGC'Gl' 

01070 

01 

"U030 

01 

ALIAS: 

C 

,no9o 

01 

01100 

01 

OPERANDS: 

'YYMM'  — THE  YEAR  AND  MONTH  OF  DATA  AT  WHICH  TO 

OHIO 

01 

1 1120 

01 

'NMON'  — AN  INTEGER  VALUE  SIGNIFYING  THE  NUMBER 

01130 

01 

• OF  MONTHS  OF  DATA  DESIRED. 

01140 

01 

• 1150 

01 

TERM  — ALIAS:  T 

•-1160 

01 

SPECIFIES  THAT  ONE  OR  MOPE  TERMS  ARE  DESIRED 

01170 

01 

'Tl'  — INTEGER  SIGNIFYING  A TERM. 

01130 

01 

• 

H190 

01 

CG  — SPECIFIES  THAT  ONE  OR  MORE  CHARACTERISTIC 

O1200 

01 

GROUPS  IS  DESIRED. 

01210 

01 

'G1 ' INTEGER  SIGNIFYING  A 

1220 

01 

CHARACTERISTIC  GROUP. 

1230 

01 

01240 

01 

NOTE: 

ALL  OPERANDS  ARE  REQUIRED. 

'H250 

01 

1260 

01 

.1270 

01 

01230 

01 

'1290 

01 

1300 

99 

MODIFY 


2010 

02 

2020 

02 

FUNCTION: 

THE  MODIFY  COMMAND  ENABLES  THE  USER  TO  MAKE  CHANGES 

02030 

02 

TO  DATA  AS  DESIRED. 

*2040 

02 

2050 

02 

SYNTAX: 

MODIFY  CUMULATIVE  MONTHS C'Ml '/'Ml 'M2'> 

*2060 

02 

— 

TARGET  < ' MTGT ' > ' TV AL ' > 

02070 

02 

2030 

02 

OR: 

MODIFY  FREQUENCY  MONTHS  <'M1  '/"Ml  ' j*  'M2'> 

2090 

02 

— 

PERCENT ✓CONSTANT < ' PCVAL' > 

02100 

02 

- 

LOSSTYPES  CALL/ ' TYFE' > ' TYPE' «* . . . ' TYPE' > 

*2110 

02 

2120 

02 

OPERANDS s 

CUMULATIVE  — ALIAS:  C 

v2 130 

02 

SPECIFIES  THAT  CUMULATIVE  DATA  IS  TO  BE  MODIFIED. 

02140 

02 

2150 

02 

FREQUENCY  — ALIAS:  F 

2160 

02 

SPECIFIES  THAT  FREQUENCY  DATA  IS  TO  BE  MODIFIED. 

02170 

02 

02130 

02 

MONTHS  — ALIAS:  M 

2190 

02 

SPECIFIES  THE  RANGE  OF  MONTHS  (INCLUSIVE} 

<••2200 

02 

OF  DATA  TO  BE  MODIFIED. 

02210 

02 

'Ml'  — INTEGER  SIGNIFYING  THE  FIRST  MONTH. 

2220 

02 

'M2'  — INTEGER  SIGNIFYING  THE  LAST  MONTH. 

2230 

02 

02240 

02 

TARGET  — ALIAS:  T 

02250 

02 

INDICATES  THAT  TARGET-MONTH  AND  TARGET-VALUE 

i 2260 

02 

FOLLOW  THIS  OPERAND. 

..2270 

02 

'MTGT'  — INTEGER  SIGNIFYING  THE  TARGET-MONTH. 

02230 

02 

'TVAL'  — REAL -NUMBER  SIGNIFYING  THE  TARGET-VALUE. 

i :290 

02 

DECIMAL  POINT  IS  REQUIRED. 

i .300 

02 

02310 

02 

PERCENT/ 

0*320 

02 

CONSTANT  — ALIAS:  P/C 

f.  330 

02 

SPECIFIES  THAT  MODIFICATION  OF  DATA  WILL  OCCUR 

0w340 

02 

BY  MEANS  OF  EITHER  A PERCENTAGE  OR  CONSTANT 

02350 

02 

VALUE t DEPENDING  UPON  WHICH  FORM  OF  THE  OPERAND 

C 360 

02 

IS  ENTERED » PERCENT  OR  CONSTANT  (P  OR  C> . 

C 370 

02 

'PCVAL'  — REAL -NUMBER  SIGNIFYING  THE  VALUE  OF  THE 

02330 

02 

PERCENT  DR  CONSTANT  EMPLOYED  TO  MODIFY 

02390 

02 

THE  DATA.  DECIMAL  POINT  IS  REQUIRED. 

€ 400 

02 

t-j41  0 

02 

LOSSTYPES  — ALIAS:  L 

02420 

02 

SPECIFIES  THAT  LOSS-TYPES  TO  BE  MODIFIED  FOLLOW 

O'*  430 

02 

THIS  OPERAND. 

f;  440 

02 

ALL  — SPECIFIES  THAT  ALL  LOSS-TYPES  ARE  TO  BE 

02450 

02 

MODIFIED. 

02460 

02 

'TYPE'  — AN  ALPHABETIC  ENTRY  SIGNIFYING  A LOSS-TYPE 

0 +70 

02 

TO  BE  MODIFIED!*  TAKEN  FROM  AMONG  THE 

0wr430 

02 

FOLLOWING: 

02490 

02 

ETS 

i>**5  0 0 

02 

IMRE 

0 '510 

02 

DFMC 

0cT520 

02 

TRDP  EITHER  FORM 

02530 

02 

TDP  ACCEPTABLE. 

0*  .5*0 

02 

MCDT 

0 .550 

02 

UNFT 

02560 

02 

PHYD 

0*570 

02 

UNKN 

0 530 

02 

OTHP 

0*536 

02 

ADMN 

1 1 
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MODIFY  (cont.) 


02600 

02 

02610 

02 

02620 

02 

02630 

02 

02640 

02 

02630 

02 

02660 

02 

02670 

02 

02680 

02 

02690 

02 

02700 

02 

02710 

02 

02720 

02 

02730 

02 

02740 

02 

02730 

02 

02760 

02 

02770 

02 

02730 

02 

02790 

02 

02800 

02 

0231  0 

02 

02320 

02 

02330 

02 

02840 

02 

02830 

02 

02860 

Oc 

02870 

99 

EXTN 

LDSSTYPE  ENTRY  MAY  INCLUDE  ANY  NUMEER 
OF  TYPES  DESIRED.  AND  MAY  BE  ENTERED  IN 
ANY  ORDER. 

NOTE*  IF  THE  CUMULATIVE  OPERAND  IS  SPECIFIED!*  MONTHS  AND 
TARGET  MUST  ALSO  BE  SPECIFIED.  THE  CUMULATIVE 
OPERAND  MUST  IMMEDIATELY  FOLLOW  THE  COMMAND-NAME » 
MODIFY,  OR  THE  COMM AND-AL IAS,  M.  THE  MONTHS  AND  TARGE 
OPERANDS  MAY  BE  SPECIFIED  IN  ANY  ORDER. 

IF  THE  CUMULATIVE  OPERAND  IS  SPECIFIED,  THE  PERCENT/ 
CONSTANT  AND  LOSSTYPES  OPERANDS  MAY  NOT  BE  SPECIFIED. 

IF  THE  FREQUENCY  OPERAND  IS  SPECIFIED,  THE  MONTHS, 
PERCENT/CONSTANT,  AND  LOSSTYPES  OPERANDS  MUST  ALSO 
BE  SPECIFIED.  THE  FREQUENCY  OPERAND  MUST  IMMEDIATELY 
FOLLOW  THE  COMMAND-NAME,  MODIFY,  OR  THE  COMMAND-ALIAS, 
M.  THE  MONTHS,  PERCENT/CONSTANT,  AND  LOSSTYPES 
OPERANDS  MAY  BE  SPECIFIED  IN  ANY  ORDER. 

IF  THE  FREQUENCY  OPERAND  IS  SPECIFIED,  THE  TARGET 
OPERAND  MAY  NOT  BE  SPECIFIED. 
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DISPLAY 


3010 

03 

3020 

03 

FUNCTION: 

THE  DISPLAY  COMMAND  INITIATES  THE  PRODUCTION  OF  GRAPHS 

03030 

03 

DEPICTING  EITHER  FREQUENCY  DISTRIBUTIONS  OR  CUMULATIVE 

'•'3040 

03 

DISTRIBUTIONS*  OVER  43— MONTH  PERIODS*  OF  USER-SPECIFIED 

3030 

03 

SECTIONS  OF  DATA. 

03  060 

03 

* 

03070 

03 

SYNTAX: 

DISPLAY  CUMULATIVE 

3030 

03 

3090 

03 

OR  s 

TISPLAY  FREQUENCY < ALL/ ' TYPE"* 'TYPE' * . . . 'TYPE') 

03100 

03 

03110 

03 

OPERANDS* 

CUMULATIVE  — ALIAS:  C 

3120 

03 

SPECIFIES  THAT  A GRAPH  OF  A CUMULATIVE  DISTRIBUTION 

•✓3130 

03 

IS  DESIRED. 

03140 

03 

3150 

03 

FREQUENCY  — ALIAS:  F 

3160 

03 

SPECIFIES  THAT  A GRAPH  OF  A FREQUENCY  DISTRIBUTION 

03170 

03 

IS  DESIRED. 

03130 

03 

ALL  — SPECIFIES  THAT  A GRAPH  IS  DESIRED  OF  EACH 

3190 

03 

LOSS-TYPE. 

-3200 

03 

'TYPE'  — AN  ALPHABETIC  ENTRY  SIGNIFYING  A LOSS-TYPE 

03210 

03 

FOR  WHICH  A GRAPH  IS  DESIRED*  TAKEN  FROM 

3220 

03 

AMONG  THE  FOLLOWING: 

3230 

03 

ETS 

03240 

03 

IMRE 

03230 

03 

• 

DFMC 

i 3260 

03 

TRDP  EITHER  FORM 

; J270 

03 

TDP  ACCEPTABLE. 

03230 

03 

MCDT 

- ^290 

03 

UNFT 

• 5300 

03 

PHYD 

03310 

03 

IJNKN 

03320 

03 

OTHR 

t ’.330 

03 

ADMN 

•-  *340 

03 

EXTN 

03330 

03 

0^360 

03 

LOSS-TYPE  ENTRY  MAY  INCLUDE  ANY  NUMBER 

i 370 

03 

OF  TYPES  DESIRED*  AND  MAY  BE  ENTERED  IN 

03330 

03 

ANY  ORDER. 

03390 

03 

C 400 

03 

NOTE: 

IF  CUMULATIVE  IS  SPECIFIED*  FREQUENCY  AND  LOSS-TYPES 

C 410 

03 

MAY  NOT  BE  SPECIFIED. 

03420 

03 

04430 

03 

IF  FREQUENCY  IS  SPECIFIED*  LOSS-TYPES  MUST  BE  SPECIFIED 

f 440 

03 

AND  CUMULATIVE  MAY  NOT  BE  SPECIFIED. 

03430 

03 

03460 

03 

C"‘470 

03 

Q..430 

03 

03490 

03 

05300 

03 

C 310 

99 
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KEEP 


0*01  o 

04 

54020 

04 

FUNCTION 

54030 

04 

04040 

04 

04  050 

04 

14060 

04 

..i40?0 

04 

SYNTftX 

04030 

04 

14090 

04 

ftLIftS 

'4100 

04 

041 1 0 

04 

□PERftNDS 

■•4120 

04 

4130 

04 

■j4140 

04 

04130 

04 

4160 

04 

4170 

04 

04130 

04 

■14190 

04 

4200 

04 

NOTE 

•-•42 1 0 

04 

04220 

04 

4230 

04 

+240 

04 

04250 

04 

*4260 

04 

4270 

04 

'.•4230 

04 

04290 

04 

+300 

04 

+310 

04 

04320 

04 

•4330 

04 

+340 

04 

*4330 

99 

THE  KEEP  COMMftND  ftLLDWS  THE  USER  TO  SftVE  MODIFIED 
VERS  I CHS  CF  ORIGINAL  DftTft.  UP  TC  19  SETS  CF  MODIFIED 
DftTft  MftY  BE  KEPT.  CTS  RIJTQMftT I CftLLY  EXECUTES  ft-  KEEP 
COMMftND  WHENEVER  THE  USER  ISSUES  ft  COHORT  COMMAND. 

KEEP  'NUM'  <' LftBEL O 

K 


'NUM'  — INTEGER  SIGNIFYING  THE  KEEP-SET  WHERE  THE 

MODIFIED  VERSION  OF  THE  DftTft  IS  TO  BE  STORED. 
MUST  BE  IN  THE  RfiNGE:  £1  > HUM  > 0. 

'LftBEL'  — UP  TO  64-  CHftRflCTERS  MftY  BE  ENTERED 

FOR  THE  PURPOSE  OF  USER— D I FFERENT I ftTION 
BETWEEN  VERSIONS  OF  MODIFIED  DftTft  FOR  ft 
SINGLE  COHORT. 

NEITHER  'NUM'  NOR  'LftBEL'  ftRE  REQUIRED?-  THOUGH  EITHER 
OR  BOTH  MftY  BE  SPECIFIED.  IF  'NUM'  IS  NOT  SPECIFIED? 
CTS  WILL  ftSSIGN  THE  LftRGEST  PREVIOUSLY  UNUSED  KEEP-SET 
NUMBER.  IF  ftLL  KEEP-SETS  HftVE  BEEN  USED > CTS  WILL  GIVE 
THE  USER  RN  OPPORTUNITY  TO  RE-USE  ft  KEEP-SET  OF  THE 
USER'S  CHOICE.  IF  'LftBEL'  IS  NOT  SPECIFIED?  CTS  WILL 
ftSSIGN  ft  LftBEL  OF  "NONE”.  THE  LftBEL  FOR  THE  KEEP-SET 
WHICH  IS  USED  BY  CTS  UPON  USER-ISSUftNCE  OF  ft  COHORT 
COMMftND  WILL  CONTftIN  THE  COHORT  COMMftND— STRING  ITSELF? 
ftS  ENTERED  BY  THE  USER. 
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LOAD 


r 


•5010 

05 

5020 

05 

FUNCTIONS 

THE  LORD  CQMNRND  RESTORES  USER-RCCESS  TO  DRTR  WHICH 

■j'5  03  0 

05 

HRS  BEEN  PREVIOUSLY  SAVED  BY  MEANS  OF  R KEEP  COMMAND. 

05040 

05 

5050 

05 

SYNTAX: 

LORD  "NUM" 

5060 

05 

05070 

05 

ALIAS: 

L 

CS080 

05 

5090 

05 

OPERANDS: 

'NUfT  — INTEGER  SIGNIFYING  THE  KEEP-SET  FROM  WHICH 

<,6100 

05 

DRTR  IS  TO  BE  RETRIEVED.  MUST  EE  IN  THE 

05110 

05 

RANGES  21  > NUM  > 0. 

5120 

05 

5130 

05 

NOTE: 

yNUM'  IS  REQUIRED.  IF  NOT  SPECIFIED-.  OR  IF  OUT  OF 

05140 

05 

RANGE r CTS  WILL  REJECT  THE  COMMAND. 

05150 

05 

5160 

05 

.,•5170 

05 

05130 

05 

. 5190 

05 

' 5200 

99 

I ** 


C-ll 


J 


t 


r 


6 010 

06 

• 6 0 0 

06 

FUNCTION: 

THE  REPLACE  COMMAND  ALLOWS  THE  USER  TO 

SUBSTITUTE 

06  03  0 

06 

MODIFIED 

DATA  IN  If  LACE  OF  THE  ORIGINAL 

DATA. 

06040 

06 

16050 

06 

SYNTAX: 

REPLACE 

"YYfifr 

•j6060 

06 

06070 

06 

ALIAS: 

R 

•'6030 

06 

>6090 

06 

OPERANDS: 

"YYMIV  - 

- THE  MONTH  AND  YEAR  AT  WHICH  TO  BEGIN 

06 1 0 0 

06 

REPLACEMENT  OF  ORIGINAL  DATA 

BY  MODIFIED 

06110 

06 

DATA. 

1 6 1 2 0 

06 

J6130 

06 

NOTE: 

" YYMM x I 

S A REQUIRED  ENTRY. 

06140 

06 

16150 

06 

'6160 

06 

06170 

06 

06 1 3 0 

06 

>6190 

99 

HARDCOPY 


07010 

07 

07020 

07 

FUNCTIONS 

THE  HARDCOPY  COMMAND  ALLOWS  THE  USER  T.O  CONTROL  THE 

07030 

07 

WRITING  OF  GRAPHS  AND  ACCEPTED  COMMANDS  TO  THE  PRINT 

07040 

07 

FILE. 

07050 

07 

07060 

07 

SYNTAX: 

HARDCOPY  ON^OFF/ONLY 

07070 

07 

07030 

07 

ALIAS: 

H 

07090 

07 

07100 

07 

OPERANDS: 

ON  — CAUSES  GRAPHS  TO  BE  WRITTEN  BOTH  ON  THE  TERMINAL 

07110 

07 

AND  TO  THE  PRINT-FILE. 

37120 

07 

OFF  — CAUSES  GRAPHS  TO  BE  WRITTEN  ON  THE  TERMINAL  ONLY 

07130 

07 

AND  NOT  TO  THE  PRINT-FILE. 

07140 

07 

ONLY  — CAUSES  GRAPHS  TO  BE  WRITTEN  TO  THE  PRINT-FILE 

07150 

07 

ONLY f AND  NOT  ON  THE  TERMINAL. 

37160 

07 

07170 

07 

NOTE: 

THE  SETTINGS  ARE  MUTUALLY  EXCLUSIVE — ONLY  ONE  FORM  IS 

07130 

07 

ACCEPTABLE  PER  ISSUANCE  OF  THE  HARDCOPY  COMMAND. 

37190 

07 

07200 

07 

07210 

07 

37220 

07 

37230 

07 

07240 

07 

07250 

c»c* 
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PARAMS 


'■3010 

03 

:?  oa  0 

03 

FUNCTION* 

THE 

PARAMS  COMMAND  PROVIDES  THE  USER  WITH-  THE  FOLLOWING 

03030 

03 

INFORMATION: 

03040 

08 

3050 

03 

1. 

CURRENT  YEAR  AND  MONTH 

-•3060 

03 

03070 

03 

2. 

YEAR  & MONTH  OF  FIRST  COHORT  IN  HISTORY  FILE 

'3030 

03 

3090 

03 

3. 

NUMBER  OF  MONTHS  OF  HISTORY 

03100 

08 

03110 

08 

4. 

NUMBER  OF  MONTHS  OF  PROJECTION 

3120 

03 

.3130 

08 

5. 

NUMBER  OF  CHARACTERISTIC-GROUPS*  2-YR  TERM 

03140 

03 

3-YR  TERM 

‘3150 

03 

4-YR  TERM 

3160 

03 

5-YR  TERM 

03170 

03 

6-YR  TERM 

03130 

03 

3190 

03 

SYNTAX: 

PARAMS 

3200 

03 

03210 

03 

ALIAS: 

P 

"3220 

03 

3230 

03 

□PERBNDS: 

NONE 

03240 

03 

03250 

03 

3260 

03 

.3270 

03 

03230 

03 

- 3290 

03 

3300 

99 

1) 

t j 


■ 


SCALE 


09010 

09 

'9020 

09 

FUNCTION 

*030 

09 

O9040 

09 

09050 

09 

9060 

09 

.9070 

09 

SYNTAX 

09030 

09 

•'9090 

09 

ALIAS 

9100 

09 

U91 10 

09 

OPERANDS 

09120 

09 

9130 

09 

.9140 

09  ’ 

09150 

09 

•9160 

09 

9170 

09 

09180 

09 

09190 

09 

9200 

09 

9210 

09 

09220 

09 

-9230 

09 

9240 

09 

0925 0 

09 

09260 

09 

NOTE 

9270 

09 

.9230 

09 

09290 

09 

*9300 

09 

9310 

09 

09320 

09 

09330 

09 

9340 

09 

. 9350 

09 

09360 

09 

'9370 

99 

THE  SCALE  COMMAND  PROVIDES  THE  USER  THE  OPTION  OF 
SPECIFYING  THE  FREQUENCY  AND  CUMULATIVE  SCALES  TO 
BE  USED  IN  THE  CONSTRUCTION  OF  GRAPHS  OF  DATA.  OR 
TO  SET  SCALES  OFF  <VALUE=0. 0> . 

SCALE  FREQUENCY/CUMULATIVE  OFF/"YMAX" 

S 

FREQUENCY  — ALIAS:  F 

SPECIFIES  THAT  THE  SETTING  SPECIFIED  IS  TO  BE 
APPLIED  TO  GRAPHS  GENERATED  USING  FREQUENCY  DATA. 

CUMULATIVE  — ALIAS:  C 

SPECIFIES  THAT  THE  SETTING  SPECIFIED  IS  TO  BE 
APPLIED  TO  GRAPHS  GENERATED  USING  CUMULATIVE  DATA. 

OFF  — SPECIFIES  THAT  A SCALE-FACTOR  IS  TO  BE  RESET 
TO  THE  VALUE  OF  ZERO. 

" YMAX " — A REAL-NUMBER  SIGNIFYING  THE  SCALE-FACTOR 

DESIRED.  MUST  BE  IN  THE  RANGE:  1.0  > YMAX  > 0. 
DECIMAL  POINT  IS  REQUIRED. 

EITHER  FREQUENCY  OR  CUMULATIVE  MUST  BE  SPECIFIED. 

BOTH  MAY  NOT  BE  SPECIFIED. 

EITHER  OFF  DR  "YMAX"  MUST  BE  SPECIFIED. 

BOTH  MAY  NOT  BE  SPECIFIED. 
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i 

I 

l i. 


! 


MODIFY  COMMAND  IMPLEMENTATION 

Frequency  Array  Modification 

given:  range  of  months  of  service,  m^  -*  m ^ 
loss  type 

percent  or  constant  change 

(1)  Make  specified  changes. 

(2)  Check  to  see  if  next  command  is  another  frequency  array  modification. 
If  Yes,  go  to  1;  if  No,  go  to  3. 

(3)  Recompute  frequencies  for  "ALL"  losses. 

(4)  Recompute  cumulative  loss  array. 

Cumulative  Vector  Modification 


given:  range  of  months  of  service,  m^— » m^ 

Target  loss  rate,  T,  and  associated  Month  of  Service,  K 


(1)  Make  changes  as  follows: 

• Find  Si=l-CUMi  for  all  i> 

/» 

• Find 


-0 

• Find  F=*  TTTZ — JIT  ZTT 

fs  \1/Cm2-ffil+1) 

1- 


• Find  S±-S  (l-D+SjF?  for  m^iSmj-l 

• Find  C^=l-S^for  all  i>m^ 

(2)  Recompute  frequencies  for  "ALL  LOSSES" 

(3)  Recompute  frequencies  for  individual  loss  types,  excluding  ETS  losses 


C-16 


APPENDIX  D 


STRUCTURE  SPACES  AS  VARIABLES 

STATEMENT  OF  THE  PROBLEM 

The  Army  desires  the  capability  to  treat  structure  spaces  as  linear 
programming  variables.  This  capability  would  allow  the  analyst  to  control 
the  input  of  structure  spaces,  or  targets,  within  the  matrix  generator,  and 
subsequently  in  the  LP. 

REQUIRED  SYSTEM  CAPABILITY 

When  optimizing  on  0BJ1  or  0BJ2,  COMPLIP  minimizes  the  weighted  sum 
of  the  absolute  values  of  the  deviations  from  the  target  structure  spaces. 

0BJ2  has  two  parts,  acceptable  and  excessive  deviations.  The  excessive 
deviations  are  weighted  more  heavily.  When  making  the  target  structure 
spaces  a variable  in  each  projection  month,  they  will  adjust  themselves — 
within  the  upper  and  lower  bounds — to  make  the  deviations  even  closer  to  zero. 

When  the  range  of  the  lower  and  upper  bounds  is  sufficiently  great  in 
each  month,  all  the  deviations  would  likely  be  computed  as  zero  by  adjusting 
the  variable  target,  hence  computing  an  "idealized"  target — idealized  in 
the  sense  of  minimum  deviations.  Whether  this  "idealized"  target  is  useful 
in  the  context  of  the  manpower  program  is  another  question. 

It  is  also  conceivable  that  with  the  lower  and  upper  bound  range 
sufficiently  large,  COMPLIP  would  adjust  the  variable  target  to  either 
extreme  in  order  to  minimize  deviations  rather  than  adjust  the  accessions. 

This  should  certainly  be  checked  by  making  experimental  runs.  The  user  must 
determine  acceptable  limits  on  the  variable  target  in  order  to  produce  a 
reasonable  manpower  program  and  adjusted  structure  spaces. 

METHODOLOGY 

The  following  section  presents  the  source  code  modifications  to  the 
matrix  generator  which  were  delivered  to  USAMSSA  to  be  implemented  and  tested 
by  them. 
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In  Subroutine  EDIT  the  user  table  TARG  must  be  re-defined  to  contain 
two  additional  columns  with  column  names  FAC1  and  FAC2.  Column  FAC1  has 
for  each  month  the  fraction  of  the  target  in  column  RHS  (the  only  column  now 
in  TARG)  that  becomes  the  lower  bound  on  the  new  variable  TAR...11. 
Similarly,  the  column  FAC2  contains  the  values  for  the  upper  bound.  Typical 
values  in  FACl  and  FAC2  would  be  .95  and  1.05  respectively.  That  is,  values 
in  FACl  are  less  than  or  equal  to  1.0,  and  in  FAC2  they  are  greater  than  or 
equal  to  1.0. 

Let  t represent  the  index  in  array  TNAME  for  table  TARG  in  EDIT.  Then 
the  following  change  is  in  order: 

DATA  NCOL  (t)  / 3 / 

In  MATGEN,  the  sections  of  the  code  corresponding  to  E.AVDlii  and 
E.AVD2ii  need  to  be  changed.  The  code  for  U.DAC.ii  need  not  be  changed. 

C E.AVDIII  *** 

C IF  (FOUND.  . . 

IF  (.NOT.  (FOUND  (4H0BJ1,.  . . 

CALL  FETCH  (4HTARG,  ARRAYU,  . . . 

JOFSET  - 3+J0RGU,  LIM1,  LIM2) 

KOFSET  - 3+JORG(0,  LIM1,  LIM2) 

J20FST  - 3+JORG(2,  LIMl’,  LIM2) 

J30FST  - 3+J0RG ( 3 , LIMl,  LIM2) 

CALL  ENDTAR (ARRAYU (KOFSET+1) , ARRAYU (J0FSET+1,M,LIM2) 

CALL  ENDTAR(ARRAYU (KOFSET+1) , ARRAYU (J20FST+1,M,LIM2) 

CALL  ENDTAR (ARRAYU (KOFSET+1) , ARRAYU (J30FST+1,M,LIM2) 

CALL  EAVD1 (ARRAYU (JOFSET+1) ,ARRAYU(J20FST+1) , 

* ARRAYU (J30FST+1) ,A1,LIM2) 
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for  ***  E.AVD2II  ***  the  coding  changes  are  similar: 


J20FST  - . . . 

J30FST  = . . . 

CALL  ENDTAR  . . . JOFSET+1,  . . . 

" ...  J20FST+1,  . . . 

" ...  J30FST+1,  . . . 

CALL  EAVD2 (ARRAYU (JOFSET+1) ,ARRAYU(J20FST+L) ,ARRAYU(J30FST+1) 
* A1.LIM2) 


The  changes  to  subroutines  EAVD1  and  EAVD2  are  exactly  the  same.  Each 
has  three  line  changes,  the  calling  sequence,  a type  statement  and  the  CALL 
RHS2 ( . . .).  That  is: 

SUBROUTINE  EAVD  {*}  (TAR,FAC1,FAC2,A1,M5) 

• • • 

REAL  TAR  (M5  ) , FAC1  (M5  ) , FAC2  (M5  ) 


replace  CALL  RHS2  (...)  with 


CALL  COL22(4HTAR. ,2H. . ,1, 
CALL  BND2 (2HL0 , 4HTAR . , 2H . 
CALL  BND2(2HUP,4HTAR. ,2H. 


4HE . AV , 2 HD 
. ,1 ,FAC1 (I 


3 


) 


. ,I,FAC2(I) ) 


.1. 


* -TAR(I) ) 


where 
changed . 


means: 


use  1 when  EAVD1  is  changed  and  use  2 when  EAVD2  is 
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APPENDIX  E 
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MATRIX  GENERATOR  ENHANCEMENTS 

STATEMENT  OF  THE  PROBLEM 

The  Army  desires  the  capability  within  the  matrix  generator  to  ex- 
press linkages  between  the  various  characteristic  groups  as  ratios  or  per- 
centage of  each  other  (e.g, , Mental-Category-IV  High-School-Diploma-Graduate 
equals  10  percent  of  all  High-School-Diploma-Graduate  Accessions  during  a 
particular  projection  period).  In  addition  to  this  enhancement,  the  capabi- 
lity to  equate  one  of  the  characteristic  groups  to  WAC  females  and  to 
accommodate  the  special  constraints  of  the  budget  year  by  developing  different 
capabilities  for  the  budget  year  and  the  remaining  years  will  be  new 
capabilities  within  the  matrix  generator. 


REQUIRED  SYSTEM  CAPABILITY 

The  existing  ELIM-COMPLIP  system  has  been  modified  to  provide  the  user 
with  the  capability  within  the  matrix  generator  to  specify  the  accession 
level  of  first-term  enlistments  of  one  characteristic  group  as  a fraction  of 
its  accession  level  of  another  characteristic  group  and  the  capability  to 
equate  one  of  the  characteristic  groups  to  user-supplied  WAC  females, 

I 

i The  design  and  source  code  modifications  necessary  to  incorporate  these 

new  features  into  COMPLIP  have  been  made  and  delivered  to  USAMSSA  to  be 

I implemented  and  tested  by  them.  Once  COMPLIP  has  been  positively  tested,  the 

modified  matrix  generator  code  must  be  incorporated  with  the  modifications 
made  by  GE  TEMPO  for  the  training  model, 

METHODOLOGY 

The  following  represent  the  source  code  modifications  which  allow 
linkages  between  the  various  characteristic  grops,  one  characteristic  group 
to  be  equated  to  WAC  females,  and  the  special  constraints  for  the  budget 
year  as  new  capabilities  of  the  matrix  generator. 

In  MGEDIT 

UPDATE  TOBJ  to  accommodate  type  4 tables  with  two  digit  numerals  up  to 

the  number  24  (e.g. , CG  01,  02,  , 16;  CG  21,  22,  23,  24).  Skip  17.  18. 

19,  20. 


E-l 


New  User  Tables 


I 

V 

I 


* 


Proportionality  Constraints  Relating  Group  1 to  Group  2 

TABLE  PROF 

FAC1  FAC2  . . . FAC6 

MONFY 


Proportionality  Constraint  set  1 
GRP1  - FAC1  * GRP 2 

TABLE  PR01 

GRP1  GRP2 

01  1.0  0.0 

02  1.0  0.0 


• 

• 

• 

• 

• 

• 

• 

16 

• 

1.0 

• 

0.0 

21 

0.0 

1.0 

22 

0.0 

0.0 

23 

1.0 

0.0 

24 

0.0 

0,0 

1 
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Constraint  set  2 
GRP1  - FAC2  * GRP  2 


TABLE 

01 

PR02 

GRP1 

1.0 

GRP  2 

0.0 

02 

1.0 

0.0 

• 

• 

• 

• 

• 

• 

• 

• 

• 

16 

' 1.0 

0.0 

21 

0.0 

1.0 

22 

0.0 

0.0 

23 

1.0 

0.0 

24 

0.0 

0.0 

TABLE 

e-g- 

PR06 

, PRO 4(GRP1)  ii-PROF  (FACJ2)  *PRO/(GRP2) 

COMPLIP  Equation 


EYROPiil 


NCG34  NCG256  NCG34  NCG256 

- yFTQCjjii  -YFTQ2jjil  + f ...  ^ FTQCjjil  + f ...  2 FTQ2jjii  -0 

11A  jj-1  ji-1 

U&2JI 


jjeP 


2j2 


jjeP 


2J2 


j 

jjeP 


2J2 


Where  P^  are  defined  as  follows: 


P^»  set  of  Characteristic  Groups  that  are  to  be  proportional  to  a 


second  set  P^^g ; the  proportionality  constant  being  e*8* 

refers  to  columr 
Table  PR02,  and 


refers  to  column  GRP1  in  Table  PROl,  ?2^  refers  to  column  GRP2  in 

refers  to  FACT  for  month  ii  in  Table  PROF. 


I 
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Code  Modifications 


MATGEN;  At  front  in  MATGEN 
INTEGER  PROP (6) 

t 

DATA  PR0P/4HPR01,  4HPR02,  4HPR03,  4HPR04,  4HPR05,  4HPR06/ 

Insert  the  following  just  before  "CALL  CURTIS (...)" : 

C ***  EPROP  ***  TWO  SETS  OF  CG  PROPORTIONAL  TO  EACH  OTHER 
DO  2190  L=l,6 

IF  (.NOT.  (FOUND (PROP (L),  DIRU,  LDIRU) ) ) GOTO  2190 
FETCH  (UHPROF,  ARRAYU,  LNU,  BUFU,  LBUFU,  DIRU,  LDIRU, 

LUNU,  MSGFIL) 

JOFSET  =>  3+JORG  (L,  LIM1,  LIM2) 

C move  proportionality  constant  into  W0RK1 

CALL  MOVER  (W0RK4 (LIMI) , ARRAYU  (JOFSET  + LIM1) , 1 - LIM1  + LIM2) 
LIM1F  =*  LIMI 
LIM2F  = LIM2 

DO  2090  II  » LIM1F,  LIM2F 
IROW  - L*100  + II 

CALL  R0W3  (1HE,  4HEPRO , 1HP,  IROW) 

FETCH  (PROP (L) , ARRAYU,  LNU,  BUFU,  LBUFU,  DIRU,  LDFRU , LUNU,  MSGFIL) 
JOFSET  = 3 + J0RG(L,  LIMI,  LIM2) 

KOFSET  = 3 + J0RG(2,  LIMI,  LIM2) 

C CG  1-16  TERMS  3,4 

DO  2060  ICG34  = 1,  NCG34 

C ARAYU  IS  A REAL  ARRAY  EQUIV  TO  THE  INTEGER 
C ARRAY  ARRAYU 

IF  (ARAYU  (JOFSET  + ICG34)  . LE.O.O)GOTO  2050 

CALL  COL23  (4HFTQC,  FTQC2  (ICG34) , II,  4HEPRO , 1HP,  IROW,  -1.0) 

2050  IF  (ARAYU  (KOFSET  + JCG34)  .LE.O.O)GOTO  2060 

CALL  COL23  (4HFTQC , FTQC2  (ICG34) , II,  4HEPR0 , 1HP,  IROW, 

- W0RK4  (II)) 

2060  CONTINUE 
C 

C CG  21,  22,  23,  24,  TERMS  2,5,6 
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APPENDIX  F 

AN  OVERVIEW  OF  CURRENTLY  AVAILABLE 
FORECASTING  MODELS 


This  paper  provides  an  overview  of  statistical  forecasting  models. 

The  objective  is  to  provide  an  understanding  of  the  strengths  and  limita- 
tions of  what  is  currently  available  in  this  field  rather  than  an  analytical 
treatment  of  any  specific  model.  While  the  various  general  classes  of 
models  are  mentioned,  the  major  emphasis  is  on  time  series  extrapolative 
models  as  these  would  seem  most  appropriate  for  the  forecasting  problem 
being  addressed  by  General  Research.  Annex  FI  contains  bibliography  of  the 
relevant  literature. 


F-l 


I 


Quantitative  forecasting  models  have  received  consider- 


able attention  in  recent  years;  analytic  endeavors  can  be 
stratified  into  tnree  primary  groups:  survey  and  market 
research  methods , time-series  (extrapolative)  methods,  and 
causality  methods . (A  fourth  group  might  include  methods 
oriented  towards  the  systematic  development  of  a consensus 
of  expert  opinion;  the  Delphi  method  is  most  familiar  in 
this  regard.)  The  three  data-based  groups  have  been  developed 
for  use  in  rather  distinct  problem  settings;  although  there 
is  occasional  debate  regarding  model  choice  across  the 
groupings,  most  frequently  models  within  a single  group  are 
suggested  as  appropriate  in  any  given  problem. 

Survey  and  market  research  methods  of  forecasting  are 
most  generally  applied  to  problems  in  which  doubt  exists 
as  to  the  existence  of  a credible  data  base.  Typical  appli- 
cations of  such  methods  x^ould  encompass  such  decisions  as 
new  product  introduction  or  the  outcome  of  an  election. 
Implicit  in  the  choice  of  these  methods  is  the  assumption 
that  the  problem  under  study  is  sufficiently  unique  to  war- 
rant independent  analysis.  Survey  and  market  research  methods 
closely  parallel  standard  methods  of  statistics,  classical 
or  Bayesian.  They  involve  the  formulation  of  the-,  problem 
(e.g.,  the  unknowns  to  be  estimated  or  hypotheses  to  be 
tested)  , experimental  design  (e.g.  , what  types  and  quantity 
of  data  to  collect),  actual  data  collection,  and  analysis. 
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Forecasting  is  a direct  outcome  of  such  endeavors,  in  the  sense 
that  the  entire  effort  is  direcced  towards  the  relevant  "un- 
knowns" in  the  decision  environment.  The  primary  drawbacks 
of  methods  within  this  broad  category  are  expense  and  their 
"one-time"  value.  As  a result,  typical  applications  are  gener- 
ally restricted  to  non-recurring  problems  of  significant  con- 
sequence . 

Causal  models  are  applied  to  problems  in  which  a depen- 
dent variable  (or  a set  of  dependent  variables  simultaneously) 
is  believed  related  to  a set  of  independent  variables,  with 
random  errors  intruding  on  the  underlying  structural  rela- 
tionship. The  initial  impetus  for  the  development  of  models 
within  this  category  was  provided  by  various  economic  fore- 
casting problems.  Most  familiar  of  the  causal  models  is  re- 
gression analysis,  in  which  a dependent  variable,  y,  is 
believed  linearly  related  to  a set  of  independent  variables, 

Xi  , . . . , Xtl,  by  the  relation 

y = 30+BiXl-r...  + BnXn  + U , 

where  the  B's  are  unknown  coefficients  and  n is  an  unobserved 
error  term.  This  single-equation  model  is  readily  extended 
to  simultaneous -equation  versons  within  which  a set  of  depen- 
dent variables,  yt,  ...,  ym  are  jointed  determined  within 
a set  of  m equations , each  potentially  involving  both  depen- 
dent (y)  variables  and  independent  (x)  variables  as  well  as 
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an  error  term.  Forecasting  with  causal  models  involves  two 
stages  of  analysis:  the  interpretation  of  the  unknown  para- 
meters of  the  model  from  data  on  the  y's  and  x's,  and  the 
preparation  of  forecasts  from  the  estimated  model.  Subsid- 
iary analytic  efforts  are  often  required  in  the  latter  stage, 
for  example,  when  the  independent  variables  themselves  are  not 
known  with  certainty.  Forecast  accuracy  with  causal  models 
are  governed  by  three  interrelated  factors.  The  quality  of 
the  estimates  of  the  unknown  model  parameters,  the  relative 
magnitudes  of  the  structural  and  random  components  of  the 
model,  and  the  degree  of  certainty  regarding  the  inputs  to 
the  model  when  it  is  used  for  forecasting. 

The  econometric  literature  has  supplied  a wide  variety 
of  statistical  proceedings  for  most  causal  models  of  both 
the  single  and  simultaneous  equation  types.  Given  that  a 
causal  model  can  be  postulated  (i.e.,  the  relevant  indepen- 
dent variables  identified)  and  the  existence  of  a credible 
data  base,  standard  procedures  exist  for  both  model  estima- 
tion and  forecasting.  These  procedures  generally  allow 
inferences  to  be  drawn  about  both  the  adequacy  of  the  model 
in  explaining  historical  data  and  the  accuracy  of  forecasts 
made  using  the  model.  When  causality  exists  and  is  identi- 
fied, models  drawn  from  this  general  class  are  clearly  ap- 
propriate. Efforts  of  this  type  are  generally  more  durable 
than  those  previously  discussed;  while  considerable  one-time 
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expense  may  be  incurred  in  formulating  and  estimating  causal 
models,  they  are  easily  updated  for  repeated  application. 


The  third  category  of  analytic  models,  time-series  or 
extrapolative  models,  applies  to  problems  in  which  future 
values  of  a variable  are  to  be  predicted  using  prior  obser- 
vations on  the  same  variable  as  a basis.  Extrapolative 
models,  as  well  as  being  important  in  their  own  right,  are 
often  substituted  for  causal  models.  Among  the  earliest 
applications  of  these  models  was  to  the  problem  of  sales 
forecasting.  This  application  suggests  an  important  caveat 
in  the  use  of  extrapolative  models.  Sales,  in  most  cases, 
can  be  said  to  be  "caused"  by  such  factors  as  pricing  policy, 
advertising,  etc.  Using  previous  sales  figures  to  forecast 
future  sales  may  be  reasonable  if  pricing  and  advertising 
policy  have  remained  unchanged;  on  the  other  hand,  past 
sales  figures  may  be  a poor  guide  to  the  future  when  the 
causing  variables  are  changing. 

The  recent  spurt  of  development  efforts  regarding  extra- 
polative models  has  stemmed  from  an  attempt  to  retain  the 
parsimony  inherent  in  this  class  of  models  while  avoiding 
the  need  to  implicitly  assume  an  unchanging  causal  environ- 
ment. The  former  feature  is  clearly  one  worth  retaining: 
extrapolative  models  generally  require  the  minimum  attention  • 
for  repeated  application,  drawing  only  upon  the  variable  of 
interest  for  updating  and  forecasting.  Only  the  initial 
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effort  of  choosing  among  extrapolative  models  is  noteworthy 
in  terms  of  expense  or  difficulty.  The  latter  factor  --  the 
true  causal  environment  --  is  the  key  drawback  to  models 
within  this  category.  Extrapolative  models  cannot  be  ex- 
pected to  perform  well  when  conditions  change  dramatically. 

A corollary  to  this  drawback  is  that  those  models  can  be 
expected  to  perform  best  in  short-term  forecasting,  since 
extrapolative  forecasts  of  the  distant  future  implicitly 
assume  a stable  environment. 

The  remainder  of  this  paper  discusses  various  extra- 
polative models.  Section  2 outlines  the  simplest  such  models 
and  expands  upon  the  discussion  of  this  section.  Section  3 
extends  these  models  to  consider  various  additional  regulari- 
ties in  data  which  such  models  can  accomodate.  Section  4 
describes  more  advanced  methods  developed  in  recent  years 
which  incorporate  most  of  the  earlier  efforts;  these  models 
require  initial  analytic  efforts,  but  retain  the  quality  of 
parsimony  once  developed.  Finally,  Section  5 presents  a 
summary  and  suggestions  for  consideration  in  the  present 
application. 

2 . Simple  Extrapolative  Models 

In  this  section,  the  general  estimation  problem  to  be 
considered  is  chat  of  determining  the  future  values  of  some 
variable  x from  a time  series  of  observations , x, , x2 , ....  x 
on  the  past  values  of  this  same  variable.  The  observations 
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:<i,  Xp  are  ordered  chronologically,  beginning  with  che 

value  observed  in  che  earliest  period,  X; , and  progressing 
Co  Che  nose  recently  observed  value,  . 

The  most  frequently  employed  extrapolative  estimation 
techniques  fall  generally  into  the  category  of  weighted 
averages  of  the  previous  history  of  observations.  A weighted 
average  forecast  of  the  variable  x,  denoted  x,  based  upon  the 
series  of  observations  xx , x2 , . . . , xT  may  be  written  as 

X = WjXj  -r  W2X2  + . . . + WTXT 

— > 

where  wx,  w2,  . . . , w^  are  weights  applied  to  che  previous 
observations.  Typically  these  weights  are  constrained  to 
sum  to  one: 

wi  + w2  + • • • + - I- 

Simple  and  moving  average  estimators  both  fall  within  this 
general  class.  A simple  average  estimator  is  one  in  which 
the  weight  assigned  to  each  past  observation  is  equal  to 
1/T.  Thus  the  simple  average  estimator  is 


x = (1/T) Xj  + ( 1/T) x2  + ...  + (1/T)xt 

= Xt  + X,  + . . . + xT 
T 


A moving  average  estimator  of  length  N assigns  weights  of 
(1/N)  to  the  most  recent  N observations  and  weights  of  zero 
to  the  earliest  (T-N)  observations.  Thus  a moving  average 
estimator  of  length  N is 


a. 


= (0)x1  + ...  + (0)xT_N  + (l/^)xr_N+1  + ...  + (1/N)xt 


xT-N+l 


+ . 


+ xT 


N 


1 

N 


T 

Z 

i-T-N+1 


For  example,  a moving  average  estimator  of  length  N=3  would 
be  defined  as 

% _ *T-2  + *1-1  + *T 

— _ 

3 

To  examine  the  properties  of  these  two  estimators,  con- 
sider the  following  simple  process  by  which  the  actual  values 
of  the  observations  might  be  generated: 

x.  = a + u.  , 

where  a is  an  unknown  constant  and  u„.  is  a random  disturbance 
with  a mean  of  zero  (i.e.,  E(u^)  = 0)  and  a variance  of  c2 
(i.e.,  Var  (u^)  = a2)  that  affects  each  period's  observation 
independently.  The  observations,  x,.  , will  thus  tend  to  clus- 
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cer  around  che  value  a,  with  Che  spread  determined  by  the 
variance  of  che  random  disturbance.  For  chis  simple  pro- 
cess, it  can  be  shown  thac  boch  che  simple  and  moving  average 
estimators  defined  above  have  expected  values  equal  to  the 

A 

constant  a (i.e.,  E(x)  = a);  thus,  on  average,  both  will 
tend  towards  the  deterministic  component  of  the  process 
generating  the  data.  The  variances  of  the  two  estimators, 
however,  differ.  For  the  simple  average  estimator,  Var  (x)  = 
a2/ T,  while  for  the  moving  average  estimator  of  length  N, 

Var  (x)  = a2/N.  Since  the  length  of  che  full  series  exceeds 
the  length  of  any  moving  average  estimator  (T>N) , the  vari- 
ance of  the  simple  average  estimator  will  be  smaller.  From 
these  calculations,  the  general  inference  might  be  drawn 
that  a simple  average  estimator  dominates  a moving  average 
estimator  when  the  underlying  process  generating  the  obser- 
vations is  stable  from  period  to  period  (other  than  an  unpre- 
dictable component  such  as  the  disturbance  u. ) . 

This  conclusion,  however,  is  altered  when  the  underly- 

\ 

ing  process  is  less  stable.  For  example,  if  for  some  rea- 
son the  constant  value,  a,  shifts  to  a different  value,  b, 
after  some  time  t=M  in  the  process  (i.e., 


b + u^  for  t > M, 
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where  the  disturbances  are  as  before) , then  it  can  be 
shown  that  the  expected  values  of  the  simple  and  moving 
average  estimators  are  no  longer  the  same.  The  expected 
value  of  the  simple  average  estimator  can  be  determined  to 
be 

E(x)  - . 

Thus,  future  estimates  using  the  simple  average  estimator 
will  be  forever  biased  away  from  b as  a result  of  the  inclu- 
sion of  the  data  from  the  first  M periods.  A moving  average 
estimator  of  length  N,  on  the  other  hand,  will  have  as  its 
expected  value  E(x)  = b after  period  M+N  (i.e.  , after  the 
transitional  period).  Therefore,  the  general  inference  may 
be  drawn  that  the  moving  average  estimator  adjusts  better 
than  the  simple  average  estimator  to  changes  in  the  under- 
lying process. 

The  above  simple  examples  suggest  two  generally  desir- 
able properties  of  extrapolative  estimators:  the  ability 
to  adjust  to  structural  changes  that  occur  in  the  under- 
lying process  and  stability  under  the  condition  that  obser- 
vations may  merely  reflect  random  disturbances  rather  than 
changes  in  the  actual  process.  As  suggested  above,  the 
moving  average  estimator  is  preferable  with  regard  to  the 
first  property  and  the  simple  average  is  preferable  with 
regard  to  the  second. 
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However,  since  both  the  presence  of  random  disturbances 
and  the  possibility  of  structural,  change  are  typically  pre- 
sent in  most  environments  to  which  such  estimation  models 
might  be  applied,  neither  of  these  estimators  is  entirely 
satisfactory. 

A third  weighted  average  technique , known  as  exponen- 
tial smoothing,  combines  the  concepts  underlying  the  two 
averaging  techniques  discussed  above  in  that  it  both  places 
weight  upon  all  the  past  observations  in  making  forecasts 
(as  does  the  simple  average)  and  weights  more  heavily  data 
from  the  most  recent  periods  (as  do  moving  average  models). 
The  exponential  smoothing  technique  places  weight  w^  = 

X(l-X)  , where  X is  a number  between  zero  and  one,  on 
each  observation  x^  and  thus  can  be  written  as 

x = X(l-X)T_1x1  + X(1-X)T-2x2  + ...  + X(l-X)xT_1  + XxT  . 

The  weights  applied  to  the  observations  thus  decline  geome- 
trically from  the  most  recent  observation  to  the  earliest 
observation. 

As  the  length  of  the  series  of  past  observations  grows, 
the  sum  of  the  weights  approaches  one.  In  fact,  since  for 
any  reasonable  large  value  of  T the  weights  applied  to  early 
observations  are  small  (e.g.,  for  T=20  and  X=0.5,  the  weight 
w^  applied  to  the  first  observation  is  0.00000095),  the  time 
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series  will  be  considered  to  be  arbitrarily  long.  The  expo- 
nential smoothing  equation  can  then  be  re-written,  using  sum- 
mation notation,  as 

co 

x = I X(l-X) 1x_  . 

i=0  1 • 

This  equation  can  be  re-written  in  several  useful  ways. 

First,  by  separating  the  term  applying  to  the  most  recent 
observation  (i.e.,  XXp)  , the  remaining  terms  can  be  rewrit- 
ten as 

OO 

2 X(l-X)1xT  . 
i=l  i'1 


- (1-X)  E 

X(l-X)i“1xT_i 

i=l 

00 

= (l-x)  E 

X(l-X)  Xr£_^_^ 

i=0 

= (l-X)x  , 

l 

where  x_^  is  the  exponential  smoothing  forecast  after  T-l 
observations  (but  before  the  T^  observation)  . The  exponen- 
tial smoothing  equation  can  then  be  rewritten  as 

x = Xxj,  + (l-\)x_^ 
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i.e.,  the:  predicted  value  x is  a weighted  average  of  the 
previous  prediction  x_-^  and  the  most  recent  observation  x^. 
The  choice  of  a smoothing  constant  (A)  close  to  zero  places 
greater  weight  on  the  previous  forecast  than  on  the  most 
recent  observation;  conversely,  a value  of  A close  to  one 
places  relatively  greater  weight  on  the  most  recent  data. 

A simple  rearrangement  of  the  above  equation  suggests 
a third  interpretation  of  the  technique.  Rewriting  the 
equation  as 

x = Axt  + (1-A)x_1 

'Xj  'Xj 

= Axt  + x_-^  - Ax_^ 

= x_L  + A(xt-x1) 

suggests  that  the  new  forecast  is  obtained  by  revising  the 
previous  forecast  x_^  by  some  fraction  A of  the  error  (x,-,-x_^ 
between  the  actual  observation  and  the  previous  forecast. 

The  choice  of  a value  for  the  smoothing  constant,  A, 
like  the  choice  between  a simple  and  moving  average  model, 
depends  in  part  upon  the  trade-off  between  stability  in  the 
presence  of  random  disturbances  and  the  speed  of  adjustment 
to  changes  in  the  underlying  process.  A low  value  of  A im- 
plies a preference  for  the  former  characteristic  in  the  esti- 


F-13 


mator;  a higher  value  of  A implies  a preference  for  the  latter. 
In  practice,  one  might  choose  a value  by  dividing  the  time 
series  of  observations  into  two  parts,  constructing  exponen- 
tial smoothing  estimators  for  various  values  of  A on  the 
first  part  of  the  series,  and  choosing  the  value  that  fore- 
casts best  (e.g.,  in  terms  of  average  absolute  error,  average 
squared  error,  (or  Theil's  U-statistic) , on  the  second  part 
of  the  series. 

The  extreme  simplicity  of  these  models  allows  very  effi- 
cient computer  implementation.  Once  a particular  form  (e.g., 
length  of  moving  average  or  vaJjue  of  A)  is  selected,  minimal 
storage  is  required.  A moving  average  requires  storing  only 
the  most  recent  N observations;  exponential  smoothing  only 
the  most  recent  forecast  and  the  current  observation. 


Inclusion  of  Regularities  in  Extrapolative  Models 


Many  data  series  of  interest  can  be  decomposed  into 
identifiable  elements:  a permanent  component,  a trend  compo- 
nent, and  a periodic  (seasonal  or  cyclical)  component.  Decom- 
position methods  allow  the  extraction  of  these  components 


separately  from  the  time  series  for  use  in  preparing  a series 


of  forecasts.  Such  methods  can  be  applied  by  a minor  exten- 


sion of  the  exponential  smoothing  models  described  in  Section 


2.  The  extension  essentially  requires  only  the  use  of  three 
parallel  smoothing  models  --  one  fcr  estimating  each  compo- 


i 


5 


F-14 


nent  of  the  model.  Letting 


Xt  = actual  observation  in  period  t 
X = estimated  permanent  component  for  period  t 
Sc  = estimated  periodic  index  for  period  t 
At  = estimated  trend  for  period  t 
L = number  of  periods  in  a cycle,  and 
= smoothing  coefficients , 

the  decomposition  model  can  be  developed  as  follows.  First, 
the  permanent  component  X is  estimated: 

y 

Xt  - + (1-XL)  <Xt-1  + At_1). 

t — L 


The  estimate  is  a weighted  average  of  the  observed  (deseason- 
alized)  value  and  the  previously  predicted  permanent  value. 
Next,  periodic  indices  are  updated: 


S 


t 


+ d-x2)st.L 


again  as  a weighted  average  of  observation  and  previous  pre- 
diction. Finally,  trend  estimates  are  prepared: 


\ - 


X3(Xt  - 


xt-l>  + U-x3)Ac.1 


as  a weighted  average  of  the  observed  and  previously  predicted 
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trends.  Using  the  decomposed  indices,  forecasts  for  future 
periods  can  be  prepared.  A forecast,  X M , made  in  period 
t for  period  t+N  (i.e. , N periods  away)  can  be  recomposed 
using  the  equation 


Xt,N  “ <Xt  + NVS.:-L+N 


Several  observations  can  be  made  regarding  this  exten- 
sion to  the  basic  extrapolative  models.  First,  the  smooth- 
ing constants  (A's)  retain  their  interpretation:  larger 
values  place  high  weight  on  recent  observations  and  allow 
rapid  response  to  changes  in  data,  while  smaller  values 
provide  slower  response  and  thus  greater  stability.  Choices 
of  values  for  the  X's  thus  again  reflect  the  underlying 
trade-off  described  in  Section  2.  Secondly,  little  stor- 
age is  required  for  use  of  this  model  despite  its  extended 
coverage.  Only  the  current  observation,  the  current  esti- 
mates of  the  permanent  component  and  trend,  and  one  cycle's 
work  of  periodic  indices  are  required  for  use  of  the  model. 
Rapid  updating  and  forecasting  are  thus  facilitated. 


4.  Recent  Advances  in  Extrapolative  Modelling 

The  classical  methods  described  above  have  recently 
been  extended  by  the  developments  of  Box  and  Jenkins  and 
their  associates.  Their  contribution  consisted  basically 
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of  the  development  of  efficient  estimation  techniques  for 
a general  class  of  time-series  models  now  familiarly  known 
as  ARIMA  (auto-regressive  integrated  moving  average)  models. 
The  general  application  of  these  models  is  seen  from  a 
consideration  of  the  nature  of  time-series  data.  As  sug- 
gested in  Section  2,  an  observation  X can  be  considered  to 
have  both  a premanent  (stationary)  component,  a,  and  a ran- 
dom (stochastic)  component,  u: 

X = a + u . 

The  ARIMA  models  are  developed  from  a consideration  of  how 
a series  of  observations  X-^ , ...,  X^  can  be  generated  from 
such  a process.  First,  like  the  simple  exponential  smooth- 
ing model  of  Section  2,  each  observation  could  be  related 
to  the  previous  observarions  by  a linear  model  of  the  form 

X,  =o(0  +<^X  +0*2Xt-2  + + u’ 

or,  in  finite  form, 

X_  =a/n  + X.  , +ot~x  0 X„  + u . 

t 0 1 t-1  2 t-2  ^p  t-p 

The  above  form  is  referred  to  as  auto-regressive  in  that  the 
equation  resembles  the  regression  models  described  in  Section 
1,  with  the  dependent  variable  X,  expressed  as  a function  of 
its  own  past  values  X.  , , ....  X. 

L — X u *"p  . 
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A second  possible  way  in  which  the  series  could  be 
generated  resembles  the  moving  average  process  described 
earlier.  The  actual  random  disturbance  affecting  the  t*"*"1 
observation  could  be  written  as  a moving  average  of  the  form 


v = v,.  +o^,  V , + <*'9V1_  9 + . ..  +0/  V 
t t 1 t-1  2 t-2  ’ q t-q, 


where  the  V's  represent  independent  disturbances.  The  U's, 
however,  are  correlated  over  time  and  resemble  a moving 
average . 


Combining  the  above  two  models  yields  a process  of  the 


form 


XC  -*0  ♦‘Vc-l  + 


+ Vt  + M-1  + +/q\-q  • 


which  is  referred  to  as  a mixed  auto-regressive  moving  average 
process  of  orders  p and  q (or  ARIMA  (p,q)). 

The  final  extension  employed  in  their  model  can  be 
related  to  the  discussion  of  trends  in  Section  3.  Since 
most  time-series  are  not  stationary,  and  violate  the  simple 
form  x = a + u upon  which  the  above  discussion  was  based, 
one  can  employ  differencing  operations  to  allow  the  series 
to  move  freely.  If,  for  example,  a simple  trend  exists  in 


the  data,  first  differences 


will  be  stationary,  and  the  ARIMA  general  model  can  be  ap- 
plied to  them.  The  general  ARIMA  model  of  orders  p,  d,  and 
q is  thus  merely  the  ARIMA  model  of  orders  p and  q applied 
to  the  dL  differences  of  the  original  series.  (Note:  the 
other  regularity  discussed  in  Section  3,  seasonality,  may  be 
considered  a special  case  of  the  correlative  model  described 
above . ) 

The  ARIMA  (p,d,q)  model  thus  represents  a most  general 
candidate  for  extrapolative  forecasting,  one  which  encom- 
passes a broad  spectrum  of  underlying  processes  and  allows 
great  flexibility  in  terms  of  fitting  time-series.  Two 
statistical  problems  are  required  to  be  confronted  in  the 
use  of  such  a model:  the  selection  of  the  orders  (p,d,q) 
of  the  process , and  the  subsequent  estimation  of  the  model 
parameters.  Standard  statistical  programs  have  been  devel- 
oped for  both  phases  of  the  analysis,  and  can  be  routinely 
employed  to  yield  forecasting  equations.  These  programs 
additionally  yield  statistics  regarding  goodness  of  fit  and 
forecast  accuracy.  The  choice  between  these  methods  and 
those  described  earlier  is  basically  one  of  cost  versus 
effectiveness:  correctly  applied,  these  models  will  cer- 

tainly outperform  the  simpler  ones;  initial  estimation 
(and  to  a lesser  degree,  regular  use)  will  be  more  time 
consuming. 


5. 


Conclusions 


As  can  be  seen  from  the  foregoing,  the  thrust  of 
developmental  efforts  in  time  series  forecasting  has  been 
to  improve  reliability  by  incorporating  such  factors  as 
underlying  causal  variables,  seasonality,  trend,  non- 
stationarity  and  serial  correlation.  The  improvements  to  be 
achieved  by  such  modifications  will  naturally  depend  on 
the  particular  series.  Their  contribution  in  general  be- 
comes critical  when  long  range  forecasts  are  required. 

The  cost  of  achieving  such  improvements  lies  in  the 
added  complexity  of  the  calculations  and  additional  data 
storage.  It  should  be  noted  however,  that  most  of  this 
added  cost  is  the  one  time  cost  of  the  analysis  necessary 
to  derive  the  appropriate  model.  The  increases  in  computer 
storage  and  running  time  are  usually  small.  For  example, 
a simple  one  stage  exponential  smoothing  model  could  be 
substantially  improved  by  a one  time  analysis  designed  to 
provide  an  initial  multi-stage  estimate  and  appropriate 
weighting  constants.  As  is  shown  in  Section  2,  this  improve- 
ment can  be  incorporated  in  subsequent  model  updates  without 
additional  calculations  or  storage  requirements.  Likewise, 
incorporation  of  seasonality  and  trend  requires  a similar 
one  time  analysis  but  adds  only  two  additional  variables 
to  the  data  base. 

Where  only  a few  series  are  of  interest  it  is  generally 
desirable  to  proceed  to  the  more  sophisticated  Box-Jenkins 


forms  since  the  increment  in  regular  running  time  and 
storage  is  small  while  the  improvement  in  reliability  can 
be  great.  In  the  case  at  hand,  however,  where  approximately 
20,000  series  will  be  estimated  regularly,  even  a small  in- 
crement of  either  running  time  or  storage  assumes  unusual 
significance.  It  would  therefore  be  prudent  to  demonstrate 
that  significant  improvements  are  achievable  before  proceed- 
ing to  more  sophisticated  modeling. 

I would  recommend  that  such  one-time  analyses  proceed 
as  follows.  For  a small  subset  of  the  series,  determine 
whether  the  current  model  could  be  improved  without  further 
cost  in  running  time  or  storage.  Second,  analyze  these 
series  for  seasonality  and  trend  and  determine  the  additional 
costs  and  benefits  of  incorporating  them.  Third,  determine 
the  appropriate  Box-Jenkins  model,  estimate  parameters  for 
these  series,  and  again  calculate  the  costs  and  benefits 
of  using  them. 

Finally,  I would  recommend  that  consideration  be  given 
to  conducting  a cross-series  analysis  to  determine  how  these 
series  relate  to  one  another.  It  would  be  very  surprising 
if  many  of  these  series  were  not  functionally  related.  A 
very  recent  development  in  time  series  analysis  which  ii 
not  addressed  in  this  paper  permits  estimation  of  a series 
as  a function  of  other  series.  The  objective  of  such  analysis 
would  be  to  determine  if  more  reliable  forecasts  could  be 
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achieved  by  Che  use  of  sophis Licaccd  models  on  a few  scries 


which  could  then  be  extrapolated  to  the  rest.  My  intuition 
is  that  this  would  prove  to  be  the  case  and  that  it  would 
provide  the  best  long  term  solution. 
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APPENDIX  G 

SMOOTHING  CONSTANT  CONTROL 
BY  MONTH  OF  HISTORY 


INTRODUCTION 

In  previous  versions  of  QFAC,  the  parameter  "MONTH"  in  the  SC  CON 
user  control  file  was  used  to  specify  an  accession  cohort.  When  the  user 
wanted  to  adjust  smoothing  constants  for  a particular  calendar  month(s) 
of  history,  he  had  to  determine  the  affected  month  of  service  for  each 
available  accession  cohort  and  adjust  the  smoothing  constant  separately 
for  each  accession  cohort. 


This  requirement  is  eliminated  in  ELIM-IV.  The  user  can  now  specify 
an  adjustment  to  the  smoothing  constant  by  calendar  month (s)  which  will 
automatically  be  applied  over  all  relevant  accession  cohorts. 


CARD  INPUT  SPECIFICATIONS 

To  use  this  expanded  capability,  a new  set  of  card  images  should  be 
inserted  between  the  "*SC  CON"  and  "9999"  cards.  This  (optional)  deck 
will  start  with: 

*HIST  MON 

and  will  end  with: 

9998 

In  between  will  be  a set  of  cards  in  the  usual  form  to  specify  the 
following: 


• "LTYPE",  followed  by  a card  with  the  loss  type  or  "ALL" 

• "MONLO",  followed  by  a card  with  the  starting  calendar  month 

for  the  modification,  in  YYMM  format 


• "MONHI",  followed  by  a card  with  the  ending  calendar  month  for 
the  modification,  in  YYMM  format 


• "MONI",  followed  by  a card,  in  YYMM  format,  which  will  set  MONLO 
and  MONHI  to  the  same  calendar  month. 


• "SCVAL",  followed  by  a card,  in  F10.8  format,  that  defines  a 

multiplier  for  the  smoothing  constant.  This  multiplier 
is  initialized  to  1.0  by  the  program 

• "DO",  which  calls  the  modifications  defined  by  the  above 


There  is  no  limit  on  the  number  of  cards  in  this  deck.  The  user  should 
also  note  that  there  is  no  resetting  of  the  controls  after  a "DO",  so  each  of 
LTYPE,  MONLO,  MONHI  and  SCVAL  will  remain  unchanged  until  overridden. 
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APPENDIX  H 

REVISED  EQUATIONS  IN  THE  MATRIX  GENERATOR 
FOR  THE  EXPANDED  C-GROUPS 


INTRODUCTION 

This  appendix  displays  the  equations  in  the  Matrix  Generator  which 
have  been  modified  for  the  expanded  C-group  capability.  The  development 
involved  the  treatment  of  terms  of  enlistment  3 and  4 and  terms  of  enlist- 
ment 2,  5,  and  6 as  two  distinct  categories.  The  equations  were  modified 
to  accommodate  the  number  of  C-groups  designated  for  combined  terms  3 and 
4 (number  of  C-groups  denoted  NCG34)  and  the  number  of  C-groups  designated 
for  combined  terms  2,  5,  and  6 (number  of  C-groups  denoted  NCG256) . New 
equations  were  developed  to  reflect  upper  and  lower  limits  pertaining  to 
the  combined  terms  of  service  (for  example  to  constrain  first  term  enlist- 
ments, terms  of  service  3 and  4 years,  to  be  no  less  than  a specified 
fraction  of  total  enlistment);  new  variables  and  user  tables  were  also 
developed  to  reflect  C-group  definition  and  treatment  of  the  two  term  of 
enlistment  categories. 

The  Matrix  Generator  equations  in  C0MPLIP-G2  were  "hard-wired"  to 
"track"  first-term  enlistees  for  21  months.  The  Matrix  Generator 
equations  have  been  modified  to  allow  for  a user-specified  number  of 
"tracking  months,"  the  maximum  number  of  months  being  54. 


All  revised  Matrix  Generator  equations  have  been  entered  into  the 
source  code. 


C-GROUP  MODIFICATIONS  AND  CHANGE  IN  TRACKING  MONTHS  FROM  21  TO  tt,  WHERE  tt  IS  A USER-SPECIFIED  INPUT 
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APPENDIX  I 


MODIFICATIONS  TO  THE  NON-PRIOR  SERVICE  GAINS  (NPSG)  MODULE 

This  appendix  presents  a brief  resume  of  the  ELIM-III  Non-Prior  Service 
Gains  (NPSG)  Module  and  indicates  the  enhancements  undertaken  to  create  an 
NPSG  module  compatible  with  ELIM-IV  standards. 

ELIM-III  NPSG  MODULE 

The  ELIM-III  NPSG  Module  consists  of  the  following  components: 

e NPSG  Frequency  File:  the  basic  NPS  accession  information 

e UPDATE:  the  NPSG  Frequency  File  updating  program 

e PREPRO:  the  program  interpreting  and  implementing  user 

instructions 

e BMD:  the  regression  package  creating  accession  projections 
e HENRY:  the  graphical  analysis  program 
e AGGRE:  the  C-group  aggregation  program 

ELIM-IV  MODIFICATIONS 

NPSG  Frequency  File 

The  ELIM-III  NPSG  Frequency  File  is  divided  into  two  partitions. 
Partition  1 provides  the  following  192  disaggregations  of  male  accessions. 

• Age 

Less  than  18 
18,  19 
20,  21 

- Greater  than  21 

• Race 

- Black 

- Non-black 


1-1 


U 


U 


ii 


u 


li 


Civilian  education 

Diploma  high  school  graduate  (DHSG) 
General  education  development  (GED) 
Some  high  school 
No  high  school 
Mental  group 

I,  II,  IIIA 

IIIB 

IV,  V 

Enlistment  bonus 
Yes 
No 


Female  accessions  are  assumed  to  be  mental  group  I or  II  high  school 
graduates  (either  diploma  or  GED)  receiving  no  enlistment  bonus  and  are 
disaggregated  into  four  categories,  as  follows: 


Civilian  education 
DHSG 
GED 

Race 

Black 

- Non-black 


! 


Partition  2 of  the  ELIM-III  frequency  file  disaggregates  males  by 
term  of  service  into  the  following  94  categories: 


Age 


All  ages  combined 


Race 


-It 


• Mental  group 

I,  II,  IIIA 

IIIB 

IV,  V 

• Term  of  enlistment 

- 2 years 
3 years 

- 4,  5,  6 years 

The  ELIM-IV  NPSG  Frequency  File  eliminates  the  two-partition  concept 
and  disaggregates  accessions  into  768  categories,  which  are  all  combinations 
of  levels  of  the  following  factors: 

• Age 

- Less  than  18 
18,  19 

20,  21 

- Greater  than  21 


- Black 

- Non-black 
Civilian  education 


Not  DHSG  (NHS) 
Mental  group 

I,  II,  IIIA 

IIIB 

IV,  V 


- Female 

Term  of  enlistment 

- 2 years 

- 3 years 

- 4 years 

- 5,  6 years 


Enlistment  bonus 
Yes 
No 


The  ELIM-IV  NPSG  Frequency  File  is  an  IRIS-compatible  direct  access 
file  configured  to  contain  84  months  of  data  beginning  with  January  1972. 
The  ELIM-III  Frequency  File  was  a sequentially  organized  disk  file  capable 
of  containing  72  months  of  data  beginning  with  February  1972. 

UPDATE:  The  NPSG  Frequency  File  Updating  Program 

Due  to  the  wholesale  restructuring  of  the  NPSG  Frequency  File,  a 
completely  new  version  of  UPDATE  has  been  written  to  maintain  the  ELIM-IV 
file. 

PREPRO:  User  Instructions 

Under  the  ELIM-III  system,  aggregation  of  individual  population  cells 
to  form  dependent  variables  is  accomplished  by  listing  the  cell  numbers  to 
be  aggregated  for  a particular  variable  on  input  cards.  This  system  is 
very  tedious  for  the  196  cells  used  in  the  ELIM-III  Partition  1 but  alto- 
gether too  cumbersome  for  the  768  cells  used  in  the  ELIM-IV  data  file. 

Because  of  this  unwieldy  input  requirement  and  a desire  for  strict 
editing  of  user  specifications,  a completely  prompted  interactive  front-end 
program  (QUERY)  was  developed.  QUERY  permits  dependent  variables  to  be 
defined  by  entering  specifications  as  indicated  in  Exhibit  1.1. 


r 

li 


Additionally,  all  other  information  required  for  the  execution  of  the 
ELIM-IV  NPSG  Module  is  prompted  for,  edited,  and  recorded  for  use  by  the 
component  programs  of  the  module. 

The  ELIM-III  version  of  PREPRO  has  been  extensively  modified  to  accept 
as  input  the  card-image  file  created  by  QUERY  rather  than  a user-created 
card  deck. 


I 

l 
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BMD : The  Regression  Package 


The  BMD  regression  package  is  essentially  unchanged.  With  the 
exception  of  several  small  improvements  in  efficiency,  the  only  enhance- 
ment provided  in  the  ELIM-IV  BMD  is  the  capability  of  producing  forecasts 
for  as  many  as  89  months  past  the  end  of  historical  data. 

HENRY:  The  Graphical  Analysis  Program 

The  graphical  analysis  program  is  unchanged  except  that  forecasts 
for  months  which  cannot  be  accommodated  on  the  maximum  graph  width  are 
not  graphed  at  all. 

AGGRE:  The  C-Group  Aggregation  Program 

The  C-group  aggregation  program  is  unchanged  except  that  C-group 
aggregations  can  now  be  provided  to  the  IPM  as  well  as  to  COMPLIP  through 
the  Matrix  Generator.  The  maximum  number  of  C-groups  which  can  be  passed 
to  the  IPM  or  the  Matrix  Generator  has  been  increased  to  40  to  satisfy  the 
requirements  of  the  C-group  expansion  task. 


Exhibit  1.1 


iPECIFY  DESIRED  AG5REGAT I QMS  BY  ENTERING  IN  THE  INDICATED  = I ELDS 
C INVARIABLE  NAME  AND 

<2.*  CODES  INDICATING  CHARACTERISTICS  TO  BE  CONBINED  CLEFT- JUSTIFIED:* 
■.JLTIPLE  CODES  FOR  A CHARACTERISTIC  ARE  ENTERED  IN  A SERIES  WITHOUT 
DELIMITERS. 


; 4ARACTERISTIC  CODE 


DEFINITION 


RACE 


EDUCATION 


ENTAL  GROUP 


LESS  THAN  13 
13*  19 
20*  SI 

MORE  THAN  21 
ALL  OF  THE  ABOVE 

BLACK 

NON-BLACK 

ALL  OF  THE  ABOVE 

DHSG 
NHS  * GED 

ALL  OF  THE  ABOVE 

I*II*IIIA 
1 1 IB 
IV 

All  OF  THE  ABOVE 


MALE 

FEMALE 

ALL  OF  THE  ABOVE 


TERM 


BONUS 


5>G  YEARS 
£ YEARS 

3 YEARS 

4 YEARS 

ALL  OF  THE  ABOVE 

YES 

NO 

ALL  OF  THE  ABOVE 


TRY  OF  ” ♦FORMAT”  FOR  "NAME"  WILL  REPRODUCE  FORMATTING  HEADER 

CODES 


NAME 


R 

A A 

5 S E M 
E E D 5 


FEMALES 


? ? ? 
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